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NOVEL GENES ENCODING WHEAT STARCH SYNTHASES 
AND USES THEREFOR 

FIELD OF THE INVENTION 

5 The present invention relates generally to isolated nucleic acid molecules encoding 
wheat starch synthase enzymes and more particularly, to isolated nucleic acid 
molecules that encode wheat SSII and SSIII enzyme activities. The isolated nucleic 
acid molecules provide the means for modifying starch content and composition in 
plants, for example the ratio of amyloseiamylopectin In the starch granule of the 

10 endosperm during the grain*filling phase of endosperm development. The isolated 
nucleic acid molecules of the present invention also provide the means for screening 
plant lines to determine the presence of natural and/or induced mutations in starch 
synthase genes which affect starch content and/or composition. The isolated nucleic 
acid molecules of the present invention further provide for the screening-assisted 

15 breeding of plants having desirable starch content and/or composition, in addition to 
providing for the direct genetic manipulation of plant starch content and/or composition. 

GENERAL 

Bibliographic details of the publications numerically refeired to in this specification are 
20 collected at the end of the description. 

This specification contains nucleotide and amino acid sequence information (SEQ ID 
Nos:) prepared using the programme Patentin Version 2.0, presented herein at the 
end of the specification. Each nucleotide or amino acid sequence is identified in the 

25 sequence listing by the numeric indicator <210> followed by the sequence identifier 
(e.g. <210>1. <210>2. etc). The length, type of sequence (DMA. protein (PRT), etc) 
and source organism for each nucleotide or amino acid sequence are indicated by 
information provided in the numeric indicator fields <211>, <212> and <213>, 
respectively. Nucleotide and amino acid sequences (SEQ ID NOs:) referred to in the 

30 specification are defined by the information provided in numeric indicator field <400> 



P:\OPER\MRO\PI-WSS.PRV - 29W/99 



-2" 

followed by the sequence identifier (eg. <400>1, <400>2, etc). 

The designation of nucleotide residues referred to herein are those recommended by 
the iUPAC-lUB Biochemical Nomenclature Commission, wherein A represents 
5 Adenine, C represents Cytosine, G represents Guanine, T represents thymine, Y 
represents a pyrimidine residue, R represents a purine residue, M represents Adenine 
or Cytosine, K represents Guanine or Thymine, S represents Guanine or Cytosine, W 
represents Adenine or Thymine, H represents a nucleotide other than Guanine, B 
represents a nucleotide other than Adenine, V represents a nucleotide other than 
10 Thymine, D represents a nucleotide other than Cytosine and N represents any 
nucleotide residue. 

The designations for naturally-occurring amino acid residues referred to herein are set 
forth in Table I. The designations for a non-limiting set of non-naturally-occurring amino 
15 acids is listed in Table 2. 

As used herein the term "derived from" shall be taken to indicate that a specified 
integer may be obtained from a particular source albeit not necessarily directly from 
that source. 

20 

Throughout this specification, unless the context requires otherwise, the word 
"comprise", or variations such as "comprises" or "comprising", will be understood to 
imply the inclusion of a stated step or element or integer or group of steps or elements 
or integers but not the exclusion of any other step or element or integer or group of 
25 steps or elements or integers. 
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TABLE 1 



Amino Acid 


Three-letter Code 


VxllC iwLLwJl V-^Uvlw 


5 Alanine 


Ala 


A 


Arginine 


Arg 


JX 


Asparagine 


Asn 


IN 


Aspartic acid 


Asp 


u 


Cysteine 


Cys 


c 


10 Glutamine 


Gm 


o 


Glutamic acid 


CjrlU 


E 


Glycine 


Gly 




Histidine 


His 




Isoleucine 


He 


T 
X 


15 Leucine 


Leu 


L 


Lysine 


Lys 


K 


Methionine 


Met 


M 


Phenylalanine 


i lie 


F 


Proline 


trVO 


p 


20 Serine 


Ser 


c 


Threonine 


Thr 


T 


Tryptophan 


Trp 


W 


Tyrosine 


Tyr 


Y 


Valine 


Val 


V 


25 Aspartate/glutamate 


Baa 


B 


Asparagine/glutamine 






Any amino acid as above 


Xaa 


X 
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TABLE 2 



Non-conventional 
amino acid 



Code 



Non-conventional 
amino acid 



Code 



a-aminobutyric acid Abu 
a-amino-a-methylbutyrate Mgabu 

aminocyclopropane- Cpro 
carboxylate 

10 aminoisobutyric acid Aib 

aminonorbomyl- Norb 
carboxylate 

cyclohexylalanine Chexa 

cyclopentylalanine Cpen 

15 D-alanine Dal 

D-arginine Darg 

D-aspartic acid Dasp 

D-cysteine Dcys 

D-glutamine Dgln 

20 D-glutamic acid Dglu 

D-histidine Dhis 

D-isoleucine Dile 

D-leucine Dleu 

D-lysine Dlys 

25 D-methionine Dmet 

D-omithine Dorn 

D-phenylalanine Dphe 

D-proline Dpro 

D-serine Dser 

30 D-threonine Dthr 



L-N-methylalanine Nmala 

L-N-methylarginine Nmarg 

L-N-methylasparagine Nmasn 

L-N-methylaspartic acid Nmasp 

L-N-methylcysteine Nmcys 

L-N-methylglutamine Nmgln 

L-N-methylglutamic acid Nmglu 

L-N-methylhistidine Nmhis 

L-N-methylisoUeucine Nmile 

L-N-methylleucine Nmleu 

L-N-methyllysine Nmlys 

L-N-methylmethionine Nmmet 

L-N-methylnorleucine Nmnle 

L-N-methylnorvaline Nmnva 

L-N-methylomithine Nmom 

L-N-methylphenylalanine Nmphe 

L-N-methylproline Nmpro 

L-N-methylserine Nmser 

L-N-methylthreonine Nmthr 

L-N-methyltryptophan Nmtrp 

L-N-methyltyrosine Nmtyr 

L-N-methylvaline Nmval 

L-N-methylethylglycine Nmetg 

L-N-methyl-t-butylglycine Nmtbug 

L-norleucine Nle 
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D-tryptophan 


Dtrp 


L-norvaline 


Nva 




D-tyrosine 


Dtyr 


a-methyl-aminoisobutyrate 


Maib 




D-valine 


Dval 


a-methyl-Y-aminobutyrate 


Mgabu 




D-a-methylalanine 


Dmala 


a-methylcyclohexylalanine 


Mchexa 


5 


D-a-methylarginine 


Dmarg 


a-methylcylcopentylalanine 


Mcpen 




D-a-methylasparagine 


Dmasn 


a-methyl-a-napthylalanine 


Manap 




D-a-methylaspartate 


Dmasp 


a-methylpenicillamine 


Mpen 




D-a-methylcysteine 


Dmcys 


N-(4-aniinobutyl)glycine 


Nglu 




D-a-methylglutamine 


Dmgln 


N-(2-aminoethyl)glycme 


Naeg 


10 D-a-methylhistidine 


Dmhis 


N-(3-aminopropyl)glycine 


Norn 




D-a-methylisoleucine 


Dmile 


N-amino-a-methylbutyrate 


Nmaabu 




D-a-methylleucine 


Dmleu 


a-napthylalanine 


Anap 




D-a-methyllysine 


Dmlys 


N-benzylglycine 


Nphe 




D-a-methylmethionine 


Dmmet 


N-(2-carbamylethyl)glycine 


Ngln 


15 


D-a-methylornithine 


Dmorn 


N-(carbamylmethyl)glycine 


Nasn 




D-a-methylphenylalanine 


Dmphe 


N-(2-carboxyethyl)glycine 


Nglu 




D-a-methylproline 


Dmpro 


N-(carboxymethyl)glycine 


Nasp 




D-a-methylserine 


Dmser 


N-cyclobutylglycine 


Ncbut 




D-a-methylthreonine 


Dmthr 


N-cycloheptylglycine 


Nchep 


20 D-a-methyltryptophan 


Dmtrp 


N-cyclohexylglycine 


Nchex 




D-a-methyltyrosine 


Dmty 


N-cyclodecylglycine 


Ncdec 




D-a-methylvaline 


Dmval 


N-cylcododecylglycine 


Ncdod 




D-N-methylalanine 


Dnmala 


N-cyclooctylglycine 


Ncoct 




D-N-methylarginine 


Dnmarg 


N-cyclopropylglycine 


Ncpro 


25 


D-N-methylasparagine 


Dnmasn 


N-cycloundecylglycine 


Ncund 




D-N-methylaspartate 


Dnmasp 


N-(2,2-diphenylethyl) 
glycine 


Nbhm 




D-N-methylcysteine 


Dnmcys 


N-(3 ,3-diphenylpropyl) 
glycine 


Nbhe 



30 
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D-N-methylglutamine 


Dnmgln 


N-(3-guanidinopropyl) 








glycine 


Narg 


D-N-methylglutamate 


Dumglu 


N-( 1 -hydroxy ethyOgly cine 


Nthr 


D-N-methylhistidine 


Dnmhis 


N-(hydroxyethyl))glycine 


Nser 


5 D-N-methylisoleucine 


Dnmile 


N-(imidazolylethyl)) 








glycine 


Nhis 


D-N-methylleucine 


Dnmleu 


N-(3-indolylyethyl) 








glycine 


Nhtrp 


D-N-methyllysine 


Dnmlys 


N-methyl-y-aminobutyrate 


Nmgabu 


10 N-methylcyclohexylalanine 


Nmchexa 


D-N-methylmethionine 


Dnmmet 


D-N-methylomithine 


Dnmom 


N-methylcyclopentylalanine 


Nmcpen 


N-methylglycine 


Nala 


D-N-methylphenylalanine 


Dnmphe 


N-methylaminoisobutyrate 


Nmaib 


D-N-methylproline 


Dnmpro 


N-(l -methylpropy Oglycine 


Nile 


D-N-n[iethylserine 


Dnmser 


15 N-(2-methyIpropyl)glycine 


Nleu 


D-N-methylthreonine 


Dnmthr 


D-N-methyltryptophan 


Dnmtrp 


N-( 1 -methylethy l)glycine 


Nval 


D-N-methyltyrosine 


Dnmtyr 


N-methyla-napthylalanine 


Nmanap 


D-N-methylvaline 


Dnmval 


N-methylpenicillamine 


Nmpen 


Y-aminobutyric acid 


Gabu 


N-(p-hydroxyphenyl)glycine 


Nhtyr 


20 L-r-butylglycine 


Tbug 


N-(thiomethyl)glycine 


Ncys 


L-ethylglycine 


Etg 


penicillamine 


Pen 


L-homophenylalanine 


Hphe 


L-a-methylalanine 


Mala 


L-a-methylarginine 


Marg 


L-a-methylasparagine 


Masn 


L-a-methylaspartate 


Masp 


L-a-methyl-r-butylglycine 


Mtbug 


25 L-a-methylcysteine 


Mcys 


L-methylethylglycine 


Metg 


L-a-methylglutamine 


Mgln 


L-a-methylglutamate 


Mglu 


L-a-methylhistidine 


Mhis 


L-a-methylhomo 








phenylalanine 


Mhphe 


L-a-methylisoleucine 


Mile 


N-(2-methylthioethyl) 




30 




glycine 


Nmet 
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L-a-methylleucine Mleu 

L-a-methylmethionine Mmet 

L-a-methylnorvaline Mnva 

L-a-methylphenylalanine Mphe 

5 L-a-methylserine Mser 

L-a-methyltryptophan Mtrp 

L-a-methylvaline Mval 

N-(N-(2,2-diphenylethyl) 

10 carbamylmethyOglycine Nnbhm 
1 -carboxy- 1 -(2 ,2-dipheny 1- 

ethylamino)cyclopropane Nmbc 



L-a-methyllysine NJlys 

L-a-methylnorleucine Mnle 

L-a-methylornithine Mom 

L-a-methylproline Mpro 

L-a-methylthreonine Mthr 

L-a-methyltyrosine Mtyr 
L-N-methylhomo 

phenylalanine Nmhphe 
N-(N-(3 , 3-dipheny Ipropy 1) 

carbamylmethyl)glycine Nnbhe 



15 Those skilled in the art will appreciate that the invention described herein is susceptible 
to variations and modifications other than those specifically described. It is to be 
understood that the invention includes all such variations and modifications. The 
invention also includes all of the steps, features, compositions and compounds 
referred to or indicated in this specification, individually or collectively, and any and all 

20 combinations or any two or more of said steps or features. 

The present invention is not to be limited in scope by the specific embodiments 
described herein, which are intended for the purposes of exemplification only. 
Functionally-equivalent products, compositions and methods are clearly within the 
25 scope of the invention, as described herein. 
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BACKGROUND TO THE INVENTION 

The biosynthesis of the starch granule is a complex process which involves the action 
of an array of isoforms of enzymes involved in the starch biosynthesis. Following the 
formation of glucose-1 -phosphate, the enzyme activities required for the synthesis of 
5 granular starch include ADP glucose pyrophosphorylase (EC 2.7.7.27), starch 
synthases (EC 2.4.1.21), branching enzymes (EC 2.4.1.18) and debranching enzymes 
(EC 3.2.1.41 and EC 3.2.1.68) (Mouille efa/., 1996). Plants contain isozymes of each 
of these activities, and the definition of these isoforms and their roles has been 
conducted through investigation of the properties of the suite of soluble enzymes found 
10 in the stroma of the plastid, analysis of the proteins entrapped within the matrix of the 
starch granule, and mutational studies to identify genes and define linkages between 
individual genes and their specific roles. 

Starch synthases extend regions of a-1,4 glucan through the transfer of the glucosyl 
15 moiety of ADPglucose to the non-reducing end of a pre-existing a-1,4 glucan. In 
addition to GBSS, 3 other classes of starch synthase have been identified in plants, 
SSI (wheat, Li et aL, 1999 and GenBank Accession No. U48227; rice, Baba et aL, 
1993; potato, Genbank Accession No. STSTASYNT). SSII (pea, Dry et aL 1992; 
potato, Edwards et aL, 1995; maize, Harn et aL 1998 and GenBank Accession No. 
20 U66377) and SSIII (potato. Abel etaL, 1996; maize. Gao ef a/.. 1998). In the cereals, 
the most comprehensively studied species is maize, where in addition to GBSS, 
cDNAs encoding SSI, SSIIa. and SSIIb have been isolated, and both cDNA and 
genomic clones for duin have been characterised (Knight et aL, 1998; Harn et aL, 
1998; Gao et aL, 1998). In maize, the product of the du1 gene is known as maize 
25 SSII, however this gene is the homologue of potato SSIII. 

The proteins within the matrix of the wheat starch granule have been extensively 
studied (Denyer ef a/., 1995; Rahman ef a/., 1995; Takaoka ef aA, 1997; Yamamori 
and Endo. 1996) and 60. 75. 85, 100. 104 and 105 kDa protein bands can be 
30 visualised following SDS-PAGE. The predominant 60 kDa protein is exclusively 
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granule-bound and is analogous to the "waxy" granule bound starch synthase (GBSS) 
gene in maize (Rahman et al., 1995). The combination of three null alleles for this 
enzyme from each of the wheat genomes (Nakamura ef a/., 1995) results in the 
amylose-free "waxy" phenotype found in other species The 75 kDa starch synthase I 
5 (wSSI) is found in both the granule and the soluble fraction of wheat endosperm 
(Denyer ef a/., 1995; Li ef a/., 1999) and has been assigned to chromosomes 7A, 7B 
and 7D (Yamamori and Endo, 1996; Li et al., 1999). The 85 KDa band contains a 
class II branching enzyme and an unidentified polypeptide (Rahman ef a/., 1995). The 
100, 104 and 105 kDa proteins of the wheat starch granule (designated Sgp-BI, Sgp- 
10 D1 and Sgp-AI by Yamamori and Endo, 1996) have been shown to be encoded by a 
homeologous set of genes on the short arm of chromosome 78, 7A and 7D 
respectively (Yamamori and Endo, 1996; Takaoka ef a/., 1997). Denyer ef a/. (1995) 
concluded on the basis of enzyme activity assays that these proteins were also starch 
synthases. These genes are referred to hereinafter as the "wheat SSI I genes". 

15 

While GBSS has been established to be essential for amylose synthesis, the remaining 
starch synthases are thought to be primarily responsible for the elongation of 
amylopectin chains, although this does not preclude them from also having non- 
essential roles in amylose biosynthesis. Differences in kinetic properties between 

20 isoforms, and the analysis of mutants lacking various isoforms, suggests that each 
isoenzyme contributes to the extension of specific subsets of the available non- 
reducing ends. Accordingly, the production of plants that produce improved starches 
that are modified for particular purposes, for example starches having high or low 
amylose:amylopectin ratios, requires the availability of genes encoding the various 

25 starch synthase isoforms. Moreover, because of species-specific codon usages and 
variations in the kinetic parameters of these isoforms in different species, the 
production of modified starches may require the use of genes derived from particular 
species. 

30 In work leading up to the present invention, the inventors sought to modify wheat 
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starch composition and content, by providing isolated nucleotide sequences encoding 
the wheat SSI! (i.e. wSSII) and wheat SSIII (i.e. wSSIII) isoenzymes, and by 
Introducing these nucleotide sequences into plants using recombinant DNA 
technology. 

5 

SUMMARY OF THE INVENTION 

The present invention provides isolated nucleic acid molecules encoding the 100, 104 
and 105 kDa SSIl (Sgp-1) polypeptides of the wheat starch granule matrix, as 
determined using the SDS/PAGE system of Rahman et al. (1995), which polypeptides 

10 are equivalent to the 100, 108 and 115 kDa polypeptides described by Yamamori and 
Endo (1996). The present invention further provides isolated nucleic acid molecules 
encoding the soluble dulh-t/pe wheat starch synthase III polypeptide. Analysis of the 
polypeptides encoded by these nucleic acid molecules reveals several consensus 
amino acid sequence motifs (i.e., sequences having at least 25% sequence identity 

15 to any one or more of the amino acid sequences selected from the group consisting 
of (a)KVGGLGDWTS;(b)GHTVEVILPKY;(c) HDWSSAPVAWLYKEHY; (d) 
GILNGIDPDIWDPYTD; (e) DVPIVGIITRLTAQKG; (f)NGQWLLGSA; 
(g)AGSDFIIVPSIFEPCGLTQLVAMRYGS; and (h)TGGLVDTV ) that are highly 
conserved in wheat starch synthase isoenzymes, in addition to isoenzyme-specific 

20 sequences, which sequences possess utility in isolating related starch synthase- 
encoding sequences and in assaying plants for their expression of one or more starch 
synthase isoenzymes. 

Accordingly, one aspect of the present invention provides an isolated nucleic acid 
25 molecule which comprises a sequence of nucleotides which encodes, or is 
complementary to a nucleic acid molecule which encodes a wheat starch synthase 
polypeptide, protein or enzyme molecule or a functional subunit thereof selected from 
the following: 

(i) a wheat starch synthase II (wSSII) polypeptide, protein or enzyme or 
30 functional subunIt thereof which comprises an amino acid sequence which is at 
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least about 85% identical overall to an amino acid sequence set forth in any one 
of SEQ ID NOS: <400>2, <400>4. or <400>6; 

(ii) a wheat starch synthase III (wSSIII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence which is at 

5 least about 85% identical overall to an amino acid sequence set forth in any one 

of SEQ ID NOS: <400>8 or <400>10;and 

(iii) a wheat starch synthase polypeptide, protein or enzyme or functional 
subunit thereof which comprises a conserved amino acid sequence having at 
least 25% identity to an amino acid sequence selected from the group 

10 consisting of: 

(a) KVGGLGDWTS; 

(b) GHTVEVILPKY: 

(c) HDWSSAPVAWLYKEHY: 

(d) GILNGIDPDIWDPYTD; 
15 (e) DVPIVGIITRLTAQKG; 

(f) NGQWLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAMRYGS; and 

(h) TGGLVDTV 

and wherein said wheat starch synthase polypeptide further comprises an 
20 amino acid sequence having at least about 85% identity overall to an amino 

acid sequence set forth in any one of SEQ ID NOS: <400>2, <400>4, <400>6, 
<400>8 or <400>10. 



In a preferred embodiment, the isolated nucleic acid molecule encodes a starch 
25 synthase polypeptide, protein or enzyme having at least about 90% amino acid 
sequence identity to any one of SEQ ID NOS:<400>2, <400>4, <400>6, <400>8 or 
<400>10, more preferably having at least about 95% or about 97% or about 99% 
identity to any one of said amino acid sequences. 

30 In an alternative embodiment, the present invention provides an isolated nucleic acid 



« « 
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molecule which encodes a wheat starch synthase polypeptide, protein or enzyme 
molecule or a functional subunit thereof, wherein said nucleic acid molecule comprises 
a nucleotide sequence having at least about 85% nucleotide sequence identity to any 
one of SEQ ID NOS: <400>1. <400>3, <400>5, <400>7 or <400>9 or SEQ ID NOS: 
5 <400>1 1 to <400>16, or a complementary nucleotide sequence thereto. 

In a preferred embodiment, the isolated nucleic acid molecule comprises the 
nucleotide sequence set forth in any one of SEQ ID NOS: <400>1, <400>3, <400>5, 
<400>7 or <400>9 or SEQ ID NOS: <400>11 to <400>16, or is at least about 90% 
10 identical, more preferably at least about 95% or 97% or 99% identical to all or a 
protein-encoding part thereof. 

In an alternative embodiment, the present invention provides an isolated nucleic acid 
molecule which encodes a wheat starch synthase polypeptide, protein or enzyme 
15 molecule or a functional subunit thereof, wherein said nucleic acid molecule comprises 
a nucleotide sequence that is capable of hybridising under at least moderate 
stringency hybridisation conditions to at least about 30 contiguous nucleotides derived 
from any one of SEQ ID NOS: <400>1, <400>3, <400>5, <400>7 or <400>9 or SEQ 
ID NOS: <400>11 to <400>16, or a complementary nucleotide sequence thereto. 

20 

A second aspect of the present invention provides a method of isolating a nucleic acid 
molecule that encodes a starch synthase polypeptide, protein or enzyme having at 
least about 85% amino acid sequence identity to any one SEQ ID NOS:<400>2, 
<400>4, <400>6, <400>8 or <400> 10 and/or which comprises anamino acid sequence 
25 having at least 25% identity to an amino acid sequence selected from the group 
consisting of: 

(a) KVGGLGDWTS; 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 
30 (d) GILNGIDPDIWDPYTD; 
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(e) DVPIVGIITRLTAQKG; 

(f) NGQWLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAMRYGS; and 

(h) TGGLVDTV , 

5 said method comprising: 

(I) hybridising a probe or primer comprising at least about 15 contiguous 
nucleotides in length derived from any one of SEQ ID NOS: <400>1 , <400>3. 
<400>5. <400>7 or <400>9 or SEQ ID NOS: <400>11 to <400>16, or a 
complementary nucleotide sequence thereto to single-stranded or double- 

10 stranded mRNA, cDNA or genomic DNA; and 

(II) detecting the hybridised mRNA, cDNA or genomic DNA using a detecting 
means. 

Preferably, the detecting means is a reporter molecule covalently attached to the probe 
15 or primer molecule or alternatively, a polymerase chain reaction format. Accordingly, 
the present invention clearly extends to the use of the nucleic acid molecules provided 
herein to isolate related starch synthase-encoding sequences using standard 
hybridisation and/or polymerase chain reaction techniques. 

20 A third aspect of the invention provides an isolated probe or primer comprising at least 
about 15 contiguous nucleotides in length derived from any one of SEQ ID NOS: 
<400>1, <400>3, <400>5, <400>7 or <400>9 or SEQ ID NOS: <400>1 1 to <400>16, 
or a complementary nucleotide sequence thereto. 

25 Preferably, the probe or primer comprises a nucleotide sequence set forth in any one 
of SEQ ID NOS:<400>25 to <400>34. 

A fourth aspect of the present invention is directed to an isolated or recombinant starch 
synthase polypeptide, protein or enzyme, preferably substantially free of conspecific 
30 or non-specific proteins, which comprises an amino acid sequence selected from the 



f p 
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following: 

(i) a wheat starch synthase II (wSSII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence which is at 
least about 85% identical overall to an amino acid sequence set forth In any one 

5 of SEQ ID NOS: <400>2, <400>4, or <400>6; 

(ii) a wheat starch synthase III (wSSIII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence which is at 
least about 85% identical overall to an amino acid sequence set forth in any one 
of SEQ ID NOS: <400>8 or <400>10;and 

10 (iii) a wheat starch synthase polypeptide, protein or enzyme or functional 

subunit thereof which comprises a conserved amino acid sequence having at 
least 25% Identity to an amino acid sequence selected from the group 
consisting of: 

(a) KVGGLGDWTS; 

15 (b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 

(f) NGQWLLGSA; 

20 (g)AGSDFIIVPSIFEPCGLTQLVAMRYGS; and 

(h)TGGLVDTV 

and wherein said wheat starch synthase polypeptide further comprises an 
amino acid sequence having at least about 85% identity overall to an amino 
acid sequence set forth in any one of SEQ ID NOS: <400>2, <400>4, <400>6, 
25 <400>8or<400>10. 

A further aspect of the invention provides a method of assaying for the presence or 
absence of a starch synthase isoenzyme or the copy number of a gene encoding same 
30 in a plant, comprising contacting a biological sample derived from said plant with an 
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isolated nucleic acid molecule derived from any one of SEQ ID NOS: <400>1 , <400>3, 
<400>5, <400>7 or <400>9 or SEQ ID NOS: <400>1 1 to <400>16, or any one of SEQ 
ID NOS:<400>25 to <400>34, or a complementary nucleotide sequence thereto for a 
time and under conditions sufficient for hybridisation to occur and then detecting said 
5 hybridisation using a detection means. 

The detection means according to this aspect of the invention is any nucleic acid 
based hybridisation or amplification reaction. 

10 A further aspect of the present invention utilises the above-mentioned assay method 
in the breeding and/or selection of plants which express or do not express particular 
starch sythase isoenzymes or alternatively, which express a particular starch synthase 
isoenzyme at a particular level in one or more plant tissues. This aspect clearly 
extends to the selection of transformed plant material which contains one or more of 

15 the isolated nucleic acid molecules of the present invention. 

A further aspect of the present invention provides a method of modifying the starch 
content and/or starch composition of one or more tissues or organs of a plant, 
comprising expressing therein a sense molecule, antisense molecule, ribozyme 

20 molecule, co-suppression molecule, or gene-targeting molecule having at least about 
85% nucleotide sequence identity to any one of any one of SEQ ID NOS: <400>1, 
<400>3, <400>5. <400>7 or <400>9 or SEQ ID NOS: <400>11 to <400>16, or a 
complementary nucleotide sequence thereto for a time and under conditions sufficient 
for the enzyme activity of one or more starch synthase isoenzymes to be modified. 

25 This aspect of the invention clearly extends to the introduction of the sense molecule, 
antisense molecule, ribozyme molecule, co-suppression molecule, or gene-targeting 
molecule to isolated plant cells, tissues or organs or organelles by cell fusion or 
transgenic means and the regeneration of intact plants therefrom. 

30 A further aspect of the present invention provides an isolated promoter that Is operable 
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in the endosperm of a monocotyledonous plant cell, tissue or organ, and preferably in 
the endosperm of a monocotyledonous plant cell, tissue or organ. For example, the 
HMG promoter from wheat, or the maize zein gene promoter are particularly preferred, 
as is the promoter derived from a starch synthase gene of the present invention, such 
5 as a promoter that is linked in vivo to any one of SEQ ID NOS: <400>1, <400>3, 
<400>5, <400>7 or <400>9 or any one of SEQ ID NOS: <400>11 to <400>16. or a 
complementary nucleotide sequence thereto. 

A still further aspect of the present invention contemplates a transgenic plant 
10 comprising an introduced sense molecule, antisense molecule, ribozyme molecule, co- 
suppression molecule, or gene-targeting molecule having at least about 85% 
nucleotide sequence identity to any one of any one of SEQ ID NOS: <400>1, <400>3, 
<400>5, <400>7 or <400>9 or SEQ ID NOS: <400>11 to <400>16, or a 
complementary nucleotide sequence thereto or a gentic construct comprising same,^ 
15 and to plant propagules, cells, tissues, organs or plant parts derived from said 
transgenic plant that also carry the introduced molecule(s). 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a copy of a photographic representation showing the distribution of wheat 
20 endosperm starch synthases between the starch granule and soluble fractions. Lane 
1 , SDS-PAGE of wheat endosperm starch granule proteins revealed by silver staining; 
lanes 2-7, immunoblot of wheat endosperm soluble phase and starch granule proteins 
separated by SDS-PAGE from various developmental stages and probed with an anti- 
(wheat wSSII peptide) monoclonal antibody. Lanes 2-4 contain proteins from the 
25 soluble fraction of wheat endosperm at 15 days post anthesis (Lane 2); 20 days post 
anthesis (Lane 3); and at 25 days post anthesis (Lane 4). Lanes 5-7 contain proteins 
from the starch granule of wheat endosperm at 15 days post anthesis (Lane 5); 20 
days post anthesis (Lane 6); and at 25 days post anthesis (Lane 7). 

30 Figure 2 is a copy of a schematic representation comparing the nucleotide sequences 
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of cDNA clones designated wSSIIA, wSSIIB and wSSIID, encoding the starch 
synthase II polypeptides from wheat, using the PILEUP programme of Devereaux et 
al. (1984). 

5 Figure 3 is a copy of a schematic representation comparing the deduced amino acid 
sequences of starch synthase II from wheat (wSSIIA, wSSIIB and wSSIID), maize 
(maizeSSIIa and maizeSSIIb; Ham et al., 1998). pea (peaSSlI; Dry et al ., 1992) and 
potato (potatoSSII; van der Leij et al., 1991). Identical amino acid residues among 
each of these sequences are indicated below the sequences with "*". The alignments 

10 of maize SSIIa with maize SSIIb, and pea SSII and potato SSII are essentially as 
described in Harn et al. (1998) and Edwards etal. (1995). All sequences are aligned 
to position the transit peptide cleavage site below the arrow (II) between residues 59 
and 60 of the wSSilA sequence. The wSSIIpl sequence, the sequence of SGP-B1 
(peptides), and of eight conserved regions are annotated and underlined. 

15 

Figure 4 is a copy of a photographic representation of a northern blot showing the 
expression of wheat wSSII mRNA in wheat plants. Total RNAs were isolated from 
leaves pre-anthesis florets and endosperm of the wheat cultivar "Gabo", grown under 
a photoperiod comprising 16 hours daylength, and at 18 C during theday, and at 13 C 
20 during the night cycle, and probed with the wSSIIp2 DNA fragment. The source of 
each RNA is indicated at the top of the Figure as follows: Lane 1 , leaf; Lane 2, pre- 
anthesis florets; Lanes 3-1 1 , endosperm at: 4 days post-anthesis (Lane 3); 6 days 
post-anthesis (Lane 4); 8 days post-anthesis (Lane 5); 10 days post-anthesis (Lane 
6); 12 days post-anthesis (Lane 7); 15 days post-anthesis (Lane 8); 18 days post- 
25 anthesis (Lane 9); 21 days post-anthesis (Lane 10); and 25 days post-anthesis (Lane 
11). 

Figure 5 is a copy of a photographic representation showing the localization of wheat 
starch synthase II genes on the wheat genome by PGR, using the primers ssllc, sslld 
30 and sslle in the amplification reaction. The nullisomic-tetrasomic genomic DNA of 
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wheat cv. Chinese Spring was used as template DNA. Lane D, Triticum tauschii] Lane 
AB. Accession line N7DT7B having no 7D chromosome and four copies of the 78 
chromosome; Lane AD, Accession line N7BT7A having no 7B chromosome and four 
copies of the 7A chromosome; Lane BD, Accession line N7AT7B having no 7A 
5 chromosome and four copies of the 7B chromosome; Lane ABD, wheat cv. Chinese 
Spring. PCR products derived from each cDNA clone are labelled. The results indicate 
that the cDNA clones, wSSIIB, wSSIIA and wSSIID are derived from the B-, A- and D- 
genomes of wheat, respectively. 

10 Figure 6 is a copy of a photographic representation showing the purification of a wheat 
SSII genomic clone from the T. tauschii war, Strangulata (Accession No. CPI 110799) 
genomic library. A genomic clone, designated wSSII-8, was identified by hybridisation 
with the wSSIIp2 probe and purified through successive rounds of selection and 
hybridisation. 

15 

Figure 7 is a copy of a photographic representation showing a Southern blot of 
BamHI-digested genomic clone DNAs identified in a primary screening using the 
wSSIIp2 probe, following hybridisation with wSSIIp4 probe DNA which is derived from 
the 5'-end of the wSSIIA cDNA clone. Lane 8 contains DNA derived from genomic 
20 clone wSSII-8 (see Figure 6). Hybridisation of clone wSSII-8 to thw wSSIIp4 probe 
suggests that this genomic clone contains the promoter region of the wSSII gene. 

Figure 8 is a schematic representation comparing the deduced amino acid Sequences 
of the maize, potato and wheat SSIII polypeptides. 

25 

Figure 9 is a copy of a photographic representation showing the purification of a wheat 
SSIII genomic clone from a T. tauschii genomic library. A plaque was identified by 
hybridisation with a PCR-derived from the wSSIII.B3 gene (a) and purified through 
successive rounds of selection and hybridisation. The hybridisation of plaques from a 
30 third round of plaque purification is shown in (b). 
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Figure 10 is a copy of a photographic representation showing the expression of wheat 
wSSIII mRNA in wheat. Total RNAs were isolated from the endospemi of the wheat 
cultivars Wyuna (Panel a) and Gabo (Panel b) leaves pre-anthesis florets and 
endosperm of the wheat cultivar "Gabo", grown under a photoperiod comprising 16 

o o 

5 hours daylength, and at 18 C during the day cycle, and at 1 3 C during the night cycle, 
and probed with the wSSIIIpl DNA fragment derived from wSSIII.B3 cDNA. The 
source of each RNA is Indicated at the top of the Figure as follows: Lane 1, endospemn 
at: 4 days post-anthesis; Lane 2, endosperm at 6 days post-anthesis; Lane 4, 
endospemi at 8 days post-anthesis; Lane 4, endosperm at 10 days post-anthesis; 

10 Lane 5, endosperm at 12 days post-anthesis; Lane 6, endosperm at 15 days post- 
anthesis; Lane 7, endosperm at 18 days post-anthesis; Lane 8, endosperm at 21 days 
post-anthesis; Lane 9, endosperm at 25 days post-anthesis; and Lane 10, endosperm 
at 31 days post-anthesis (Panel a only). In panel (c), L refers to leaf RNA, and P refers 
to RNA from pre-anthesis florets derived from the cultivar Gabo. 

15 

Figure 11 is a schematic representation showing the relationships between the 
primary amino acid sequences of starch synthases (SS) and glycogen synthase of E. 
coli (GS). The dendrogram was generated by the program PILEUP (Devereaux et ai, 
1984). The amino acid sequences used for the analysis are those of the wheat SSIIA, 

20 wheat SSIIB, wheat SSIID, and wheat SSIII polypeptides of the present invention 
compared to the deduced amino acid sequences of wheat GBSS (Clark et ai, 1991), 
wheat SSI (Li etai, 1999), rice GBSS (Okagaki, 1992), rice SSI (Baba etai, 1993), 
maize GBSS (Kloesgen etai, 1986), maize SSI (Knight etai, 1998), maize SSIIa and 
maize SSIIb (Harn etai, 1998), maize SSIII (Gao etai, 1998), pea GBSS (Dry etai, 

25 1992). pea SSII (Dry etai, 1992), potato GBSS (van der LeIj et ai, 1991), potato SSI 
(Genbank accession number: STSTASYNT), potato SSII (Edwards etai, 1995), potato 
SSIII (Abel etai, 1996), and E. co// glycogen synthase (GS) (Kumar ef a/., 1986). Five 
groups of enzymes included in the alignment are granule-bound starch synthase 
(GBSS), starch synthase-l (SSI), starcg synthase-ll (SSII). starch synthase-lll (SSIII) 

30 and glycogen synthase (GS). 
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Figure 12 is a schematic representation showing the position of conserved regions 
within cereal starch synthase genes. Comparisons of cereal starch synthases were 
made based on their deduced amino acid sequences and 8 conserved regions 
identified. Conserved regions are shown in bold and transit peptides (where defined) 
5 in grey. The sequences included in the alignment are the wheat SSII-A1 and wheat 
SSII! polypeptides of the present invention; wheat GBSS (Ainsworth et aL, 1993); 
wheat SSI (Li etal., 1999); maize SSIIa (Harn etaL, 1998); and maize dull-1(Gao et 
ai, 1998). 

10 Figure 13 is a schematic representation showing the position of conserved amino acid 
seqeunces within four wheat starch synthase proteins. The eight highly-conserved 
regions between the wheat starch synthase polypeptides are underlined and annotated 
at the top of each group of amino acid sequences. The sequences included in the 
alignment are the wheat SSII-A1 and wheat SSIII polypeptides of the present 

15 invention; wheat GBSS (wGBSS; Yan ef a/.. 1999); wheat SSI (wSS1; Li ef a/., 1999); 
wheat SSII (wSS2; SEQ ID NO:<400>4); and wheat SSIII (wSS3; SEQ ID 
NO:<400>8). 

Figure 14 is a copy of a schematic representation of a gene map showing the 
20 alignment of fragments 1 to 6 of the genomic SSIII gene (lower line) with the 
corresponding SSIII cDNA clone (uper line). Raised regions in the genomic clone 
fragments (lower line) represent protein-encoding regions of the gene. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

25 One aspect of the present invention provides an isolated nucleic acid molecule which 
comprises a sequence of nucleotides which encodes, or is complementary to a nucleic 
acid molecule which encodes a wheat starch synthase polypeptide, protein or enzyme 
molecule or a functional subunit thereof selected from the following: 

(i) a wheat starch synthase II (wSSII) polypeptide, protein or enzyme or 

30 functional subunit thereof which comprises an amino acid sequence set forth 
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in any one of SEQ ID NOS: <400>2, <400>4, or <400>6; and 
(ii) a wheat starch synthase III (wSSIII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence set forth 
in any one of SEQ ID NOS: <400>8 or <400>10. 

5 

Alternatively or in addition, the isolated nucleic acid molecule of the present invention 
encodes a wheat starch synthase II (wSSII) polypeptide, protein or enzyme or 
functional subunit thereof and comprises a nucleotide sequence set forth in any one 
of SEQ ID NOS: <400>1 . <400>3. or <400>5. 

10 

Alternatively or in addition, the isolated nucleic acid molecule of the present invention 
encodes a wheat starch synthase III (wSSIII) polypeptide, protein or enzyme or 
functional subunit thereof and comprises a nucleotide sequence set forth in any one 
of SEQ ID NOS: <400>7 or <400>9. 

15 

As used herein, the term "starch synthase" shall be taken to refer to any enzymatically- 
active peptide, polypeptide, oligopeptide, polypeptide, protein or enzyme molecule that 
is at least capable of transferring a glucosyl moiety from ADP-glucose to an a-1 ,4- 
glucan molecule, or a peptide, polypeptide, oligopeptide or polypeptide fragment of 
20 such an enzymatically-active molecule. 

The term "wheat starch synthase" refers to a starch synthase derived from hexaploid 
wheat or barley or a progenitor species, or a relative thereto such as the diploid 
Triticum tauschii or other diploid, tetraploid, aneuploid, polyploid, nullisomic, or a 
25 wheat/barley addition line, amongst others, the only requirement that the genomic DNA 
is at least about 80% identical to the genome of a wheat plant as determined by 
standard DNA melting curve analyses. 

The term "starch synthase H" or "wSSII" or similar term shall be taken to refer to a 
30 starch synthase as hereinbefore defined that is detectable in the starch granule of a 
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plant seed endosperm and possesses one or more properties selected from the group 
consisting of: 

(!) it is immunologically cross-reactive with the wheat starch granule 
proteins designated Sgp-B1 and/or Sgp-D1 and/or Sgp-A1, having estimated 
5 molecular weights of about 85 kDa to about 115 kDa; 

(ii) it is encoded by one of a homeologous set of genes localised on wheat 
chromosomes 7B or 7A or 7D; 

(iii) it is encoded by a nucleotide sequence that comprises at least about 15 
nucleotides in length derived from any one or more of SEQ ID NOS: <400>1, 

10 <400>3, or <400>5 or a complementary nucleotide sequence thereto; 

(iv) it is encoded by a nucleotide sequence that is at least about 85% 
identical to one or more of the nucleotide sequences set forth in SEQ ID NOS: 
<400>1, <400>3, or <400>5 or a complementary nucleotide sequence thereto; 

(v) it comprises an amino acid sequence having at least about 85% identity 
15 to one or more of SEQ ID NOS:<400>2 or <400>4 or <400>6; 

(vi) it comprises at least about 5 contiguous amino acids, preferably at least 
about 10 contiguous amino acids, more preferably at least about 15 contiguous 
amino acids, even more preferably at least about 20 contiguous amino acids 
and still even more preferably at least about 25-50 contiguous amino acids of 

20 the amino acid sequences set forth in SEQ ID NOS:<400>2 or <400>4 or 

<400>6;and 

(vii) it which comprises a conserved amino acid sequence having at least 
25% identity to an amino acid sequence selected from the group consisting of: 

(a) KVGGLGDWTS; 
25 (b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 

(f) NGQWLLGSA; 

30 (g)AGSDFIIVPSIFEPCGLTQLVAMRYGS; and 
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(h)TGGLVDTV 
in addition to any one or more of (i) to (vi). 

The term "starcli synthase 111" or "wSSIII" or similar term shall be taken to refer to a 
5 starch synthase as hereinbefore defined that possesses one or more properties 
selected from the group consisting of: 

(i) it is encoded by a nucleotide sequence that comprises at least about 15 
nucleotides in length derived from any one or more of SEQ ID NOS:<400>7 or 
<400>9 or any one or more of SEQ ID NOS: <400>11 to <400>16 or a 

10 complementary nucleotide sequence thereto; 

(ii) it Is encoded by a nucleotide sequence that is at least about 85% 
identical to one or more of the nucleotide sequences set forth in SEQ ID NOS: 
<400>7 or <400>9 or any one or more of SEQ ID NOS: <400>11 to <400>16 
or a complementary nucleotide sequence thereto; and 

15 (iii) it comprises an amino acid sequence having at least about 85% identity 

to one or more of SEQ ID NOS:<400>8 or <400>10; 

(iv) it comprises at least about 5 contiguous amino acids, preferably at least 
about 1 0 contiguous amino acids, more preferably at least about 1 5 contiguous 
amino acids, even more preferably at least about 20 contiguous amino acids 

20 and still even more preferably at least about 25-50 contiguous amino acids of 

the amino acid sequences set forth in SEQ ID NOS:<400>8 or <400>10;and 

(v) which comprises a conserved amino acid sequence having at least 25% 
Identity to an amino acid sequence selected from the group consisting of: 

(a) KVGGLGDWTS; 
25 (b) GH7VEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 

(f) NGQWLLGSA; 

30 (g)AGSDFIIVPSIFEPCGLTQLVAMRYGS; and 
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(h)TGGLVDTV 
in addition to any one or more of (1) to (iv). 

In a more preferred embodiment, tlie WSSli or WSSIII polypeptide encoded by the 
5 nucleic acid molecule of the present invention will comprise a substantial contiguous 
region of any one of SEQ ID NOS: <400>2, <400>4, <400>6, <400>8 or <400>10 or 
<400>17 sufficient to possess the biological activity of a starch synthase polypeptide. 

For the purposes of nomenclature, the nucleotide sequence set forth in SEQ ID NO: 

10 <400>1 relates to the cDNA molecule encoding the WSSII (i.e. Sgp-B1) polypeptide 
of wheat. The amino acid sequence of the corresponding polypeptide is set forth 
herein as SEQ ID NO:<400>2. The nucleotide sequence set forth in SEQ ID NO: 
<400>3 relates to the cDNA molecule encoding the WSSII (i.e. Sgp-A1) polypeptide 
of wheat. The amino acid sequence of the corresponding polypeptide is set forth 

15 herein as SEQ ID NO:<400>4. The nucleotide sequence set forth in SEQ ID NO: 
<400>5 relates to the cDNA molecule encoding the WSSII (i.e. Sgp-D1) polypeptide 
of wheat. The amino acid sequence of the corresponding polypeptide is set forth 
herein as SEQ ID NO:<400>6. The nucleotide sequences set forth in SEQ ID NOs: 
<400>7 and <400>9 relate, respectively, to full-length and partial cDNA molecules 

20 encoding the WSSIII polypeptide of wheat. The amino acid sequences of the 
corresponding polypeptides are set forth herein as SEQ ID NOs:<400>8 and <400>10, 
respectivelyThe nucleotide sequences set forth in SEQ ID NOs: <400>1 1 to <400>16 
relates to fragments of the genomic gene encoding the WSSIII polypeptide of wheat, 
significant protein-encoding regions of which are described by reference to Table 3 

25 and Figure 14. 

Preferably, the isolated nucleic acid molecule of the present invention comprises a 
sequence of nucleotides which encodes, or is complementary to a nucleic acid 
molecule which encodes a wheat starch synthase III polypeptide, protein or enzyme 
30 molecule or a functional subunit thereof which comprises an amino acid sequence 
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which is at least about 85% identical overall to an amino acid sequence set forth in any 
one of SEQ ID NOS: <400>8 or <400>10 and more preferably, which additionally 
comprises which comprises one or more conserved amino acid sequences selected 
from the group consisting of: 
5 (a) KVGGLGDWTS; 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 
10 (f) NGQWLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAMRYGS; and 

(h) TGGLVDTV . 

15 The present invention clearly extends to homologues, analogues and derivatives of the 
wheat starch synthase II and III genes exemplified by the nucleotide sequences set 
forth herein as SEQ ID NOs:<400>1. <400>3. <400>5, <400>7. <400>9 and <400>11 
to <400>16. 

20 Preferred starch synthase genes may be derived from a naturally-occurring starch 
synthase gene by standard recombinant techniques. Generally, a starch synthase 
gene may be subjected to mutagenesis to produce single or multiple nucleotide 
substitutions, deletions and/or additions. Nucleotide insertional derivatives of the 
starch synthase gene of the present invention include 5' and 3' terminal fusions as 

25 well as intra-sequence insertions of single or multiple nucleotides. Insertional 
nucleotide sequence variants are those in which one or more nucleotides are 
introduced into a predetermined site in the nucleotide sequence although random 
insertion is also possible with suitable screening of the resulting product. Deletional 
variants are characterised by the removal of one or more nucleotides from the 

30 sequence. Substitutional nucleotide variants are those in which at least one nucleotide 
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in the sequence has been removed and a different nucleotide inserted in its place. 
Such a substitution may be "silent" in that the substitution does not change the amino 
acid defined by the codon. Alternatively, substituents are designed to alter one amino 
acid for another similar acting amino acid, or amino acid of like charge, polarity, or 
5 hydrophobicity. 

For the present purpose, "homologues" of a nucleotide sequence shall be taken to 
refer to an isolated nucleic acid molecule which is substantially the same as the nucleic 
acid molecule of the present invention or its complementary nucleotide sequence, 
10 notwithstanding the occurrence within said sequence, of one or more nucleotide 
substitutions, insertions, deletions, or rearrangements. 

"Analogues" of a nucleotide sequence set forth herein shall be taken to refer to an 
isolated nucleic acid molecule which is substantially the same as a nucleic acid 
15 molecule of the present invention or its complementary nucleotide sequence, 
notwithstanding the occurrence of any non-nucleotide constituents not normally 
present in said isolated nucleic acid molecule, for example carbohydrates, 
radiochemicals including radionucleotides, reporter molecules such as, but not limited 
to DIG, alkaline phosphatase or horseradish peroxidase, amongst others. 

20 

"Derivatives" of a nucleotide sequence set forth herein shall be taken to refer to any 
isolated nucleic acid molecule which contains significant sequence similarity to said 
sequence or a part thereof. Generally, the nucleotide sequence of the present 
invention may be subjected to mutagenesis to produce single or multiple nucleotide 

25 substitutions, deletions and/or insertions. Nucleotide insertional derivatives of the 
nucleotide sequence of the present invention include 5' and 3' terminal fusions as well 
as intra-sequence insertions of single or multiple nucleotides or nucleotide analogues. 
Insertional nucleotide sequence variants are those in which one or more nucleotides 
or nucleotide analogues are introduced into a predetermined site in the nucleotide 

30 sequence of said sequence, although random insertion is also possible with suitable 
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screening of the resulting product being performed. Deletional variants are 
characterised by the removal of one or more nucleotides from the nucleotide 
sequence. Substitutional nucleotide variants are those in which at least one nucleotide 
in the sequence has been removed and a different nucleotide or nucleotide analogue 
5 inserted in its place. 

The present invention extends to the isolated nucleic acid molecule when integrated 
Into the genome of a cell as an addition to the endogenous cellular complement of 
starch synthase genes, irrespective of whether or not the Introduced nucleotide 

10 sequence is translatable or non-translatable to produce a polypeptide. The present 
invention clearly contemplates the introduction of additional copies of starch synthase 
genes into plants, particularly wheat plants, in the antisense orientation to reduce the 
expression of particular wheat starch synthase genes. As will be known to those skilled 
in the art. such antisense genes are non-translatable, notwithstanding that they can 

15 be expressed to produce antisense mRNA molecules. 

The said integrated nucleic acid molecule may, or may not, contain promoter 
sequences to regulate expression of the subject genetic sequence. 

20 Accordingly, the present invention clearly encompasses preferred homologues. 
analogues and derivatives that comprise a sequence of nucleotides which encodes, 
or is complementary to a nucleic acid molecule which encodes a wheat starch 
synthase polypeptide, protein or enzyme molecule or a functional subunit thereof 
selected from the following: 
25 (i) a wheat starch synthase II (wSSII) polypeptide, protein or enzyme or 

functional subunit thereof which comprises an amino acid sequence which is at 
least about 85% identical overall to an amino acid sequence set forth in any one 
of SEQ ID NOS: <400>2. <400>4. or <400>6; 

(ii) a wheat starch synthase III (wSSIII) polypeptide, protein or enzyme or 
30 functional subunit thereof which comprises an amino acid sequence which is at 
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least about 85% identical overall to an amino acid sequence set forth in any one 
of SEQ ID NOS: <400>8 or <400>10;and 

(ill) a wheat starch synthase polypeptide, protein or enzyme or functional 
subunit thereof which comprises a conserved amino acid sequence having at 
5 least 25% identity to an amino acid sequence selected from the group 

consisting of: 

(a) KVGGLGDWTS; 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 
10 (d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 

(f) NGQWLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAMRYGS; and 

(h) TGGLVDTV 

15 and wherein said wheat starch synthase polypeptide further comprises an 

amino acid sequence having at least about 85% identity overall to an amino 
acid sequence set forth in any one of SEQ ID NOS: <400>2. <400>4. <400>6, 
<400>8 or <400>10. 

20 Preferably, the isolated nucleic acid molecule encodes a starch synthase polypeptide, 
protein or enzyme that comprises two, more preferably three, more preferably four, 
more preferably five, more preferably six, more preferably seven and even more 
preferably eight of the conserved amino acid motifs listed supra. Even more preferably, 
the said amino acid motifs are located in a relative configuration such as that shown 

25 for the wheat SSII or wheat SSIII polypeptides listed in Figure 13 herein. 

In a preferred embodiment, the isolated nucleic acid molecule encodes a starch 
synthase polypeptide, protein or enzyme having at least about 90% amino acid 
sequence identity to any one of SEQ ID NOS:<400>2, <400>4, <400>6, <400>8 or 
30 <400>10. more preferably having at least about 95% or about 97% or about 99% 
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identlty to any one of said amino acid sequences. 

In an alternative embodiment, the present invention provides an isolated nucleic acid 
molecule which encodes a wheat starch synthase polypeptide, protein or enzyme 
5 molecule or a functional subunit thereof, wherein said nucleic acid molecule comprises 
a nucleotide sequence having at least about 85% nucleotide sequence identity to any 
one of SEQ ID NOS: <400>1, <400>3, <400>5. <400>7 or <400>9 or SEQ ID NOS: 
<400>11 to <400>16, or a degenerate nucleotide sequence thereto or a 
complementary nucleotide sequence thereto. 

10 

By "degenerate nucleotide sequence" is meant a nucleotide sequence that encodes 
a substantially identical amino acid sequence as a stated nucleotide sequence. 

In a prefenred embodiment, the isolated nucleic acid molecule comprises the 
15 nucleotide sequence set forth in any one of SEQ ID NOS: <400>1 . <400>3, <400>5, 
<400>7 or <400>9 or SEQ ID NOS: <400>11 to <400>16, or is at least about 90% 
Identical, more preferably at least about 95% or 97% or 99% identical to all or a 
protein-encoding part thereof. 

20 In an alternative embodiment, preferred homologues, analogues and derivatives of the 
nucleic acid molecule of the present invention encodes a wheat starch synthase 
polypeptide, protein or enzyme molecule or a functional subunit thereof and comprises 
a nucleotide sequence that is capable of hybridising under at least moderate 
stringency hybridisation conditions to at least about 30 contiguous nucleotides derived 

25 from any one of SEQ ID NOS: <400>1, <400>3. <400>5, <400>7 or <400>9 or SEQ 
ID NOS: <400>11 to <400>16, or a complementary nucleotide sequence thereto. 

For the purposes of defining the level of stringency, a low stringency is defined herein 
as being a hybridisation and/or a wash, carried out in 6xSSC buffer, 0.1% (w/v) SDS 
30 at 28°C. Generally, the stringency is increased by reducing the concentration of SSC 
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buffer, and/or increasing the concentration of SDS and/or increasing the temperature 
of the hybridisation and/or wash. A moderate stringency comprises a hybridisation 
and/or a wash carried out in 0.2 x SSC-2 x SSC buffer, 0.1% (w/v) SDS at 42 °C to 
65°C, while a high stringency comprises a hybridisation and/or a wash carried out in 
5 0.1xSSC-0.2 X SSC buffer, 0.1% (w/v) SDS at a temperature of at least 55^C. 
Conditions for hybridisations and washes are well understood by one normally skilled 
in the art. For the purposes of further clarification only, reference to the parameters 
affecting hybridisation between nucleic acid molecules is found in pages 2.10.8 to 
2.10.16. of Ausubel etaL (1987), which is herein incorporated by reference. 

10 

Those skilled in the art will be aware of procedures for the isolation of further wheat 
starch synthase genes to those specifically described herein or homologues, 
analogues or derivatives of said genes, for example further cDNA sequences and 
genomic gene equivalents, when provided with one or more of the nucleotide 

15 sequences set forth in SEQ ID NOs: <400>1, <400>3, <400>5, <400>7, <400>9, or 
<400>11 to <400>16. In particular, amplifications and/or hybridisations may be 
performed using one or more nucleic acid primers or hybridisation probes comprising 
at least 10 contiguous nucleotides and preferably at least about 20 contiguous 
nucleotides or 50 contiguous nucleotides derived from the nucleotide sequences set 

20 forth herein, to isolate cDNA clones, mRNA molecules, genomic clones from a 
genomic library (in particular genomic clones containing the entire 5' upstream region 
of the gene including the promoter sequence, and the entire coding region and 3'- 
untranslated sequences), and/or synthetic oligonucleotide molecules, amongst others. 
The present invention clearly extends to such related sequences. 

25 

Accordingly, a second aspect of the present invention provides a method of isolating 
a nucleic acid molecule that encodes a starch synthase polypeptide, protein or enzyme 
having at least about 85% amino acid sequence identity to any one SEQ ID 
NOS:<400>2, <400>4, <400>6, <400>8 or <400>10 and/or which comprises a 
30 conserved amino acid sequence having at least 25% Identity to an amino acid 
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sequence selected from the group consisting of: 

(a) KVGGLGDWTS; 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 
5 (d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 

(f) NGQWLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAIVIRYGS; and 

(h) TGGLVDTV. 
10 said method comprising: 

(i) hybridising a probe or primer comprising at least about 15 contiguous 
nucleotides in length derived from any one of SEQ ID NOS: <400>1, <400>3, 
<400>5, <400>7 or <400>9 or SEQ ID NOS: <400>11 to <400>16, or a 
complementary nucleotide sequence thereto to single-stranded or double- 

15 stranded mRNA, cDNA or genomic DNA; and 

(ii) detecting the hybridised mRNA, cDNA or genomic DNA using a detecting 
means. 

Preferably, the detecting means is a reporter molecule covalently attached to the probe 
20 or primer molecule or alternatively, a polymerase chain reaction format. 

An alternative method contemplated in the present invention involves hybridising two 
nucleic acid "primer molecules" to a nucleic acid "template molecule" which comprises 
a related starch synthase gene or related starch synthase genetic sequence or a 

25 functional part thereof, wherein the first of said primers comprises contiguous 
nucleotides derived from any one or more of SEQ ID NOS: <400>1 , <400>3. <400>5, 
<400>7 or <400>9 or SEQ ID NOS: <400>11 to <400>16 and the second of said 
primers comprises contiguous nucleotides complementary to any one or more of SEQ 
ID NOS: <400>1. <400>3. <400>5, <400>7 or <400>9 or SEQ ID NOS: <400>11 to 

30 <400>16. Specific nucleic acid molecule copies of the template molecule are amplified 
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enzymatically in a polymerase chain reaction, a technique that is well known to one 
skilled in the art. 

In a preferred embodiment, each nucleic acid primer molecule is at least 10 
5 nucleotides in length, more preferably at least 20 nucleotides in length, even more 
preferably at least 30 nucleotides in length, still more preferably at least 40 nucleotides 
in length and even still more preferably at least 50 nucleotides in length. 

Furthermore, the nucleic acid primer molecules consists of a combination of any of the 
10 nucleotides adenine, cytidine, guanine, thymidine, or inosine, or functional analogues 
or derivatives thereof which are at least capable of being incorporated into a 
polynucleotide molecule without having an inhibitory effect on the hybridisation of said 
primer to the template molecule in the environment in which it is used. 

15 Furthermore, one or both of the nucleic acid primer molecules may be contained in an 
aqueous mixture of other nucleic acid primer molecules, for example a mixture of 
degenerate primer sequences which vary from each other by one or more nucleotide 
substitutions or deletions. Alternatively, one or both of the nucleic acid primer 
molecules may be in a substantially pure form. 

20 

The nucleic acid template molecule may be in a recombinant form, in a virus particle, 
bacteriophage particle, yeast cell, animal cell, or a plant cell. Preferably, the nucleic 
acid template molecule is derived from a plant cell, tissue or organ, in particular a cell, 
tissue or organ derived from a wheat or bariey plant or a progenitor species, or a 
25 relative thereto such as the diploid Triticum tauschii or other diploid, tetraploid, 
aneuploid, polyploid, nullisomic, or a wheat/barley addition line, amongst others. 

Those skilled in the art will be aware that there are many known variations of the basic 
polymerase chain reaction procedure, which may be employed to isolate a related 
30 starch synthase gene or related starch synthase genetic sequence when provided with 
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the nucleotide sequences set forth herein. Such variations are discussed, for example, 
in McPherson et al (1991). The present invention extends to the use of all such 
variations in the isolation of related starch synthase genes or related starch synthase 
genetic sequences using the nucleotide sequences embodied by the present invention. 

5 

As exemplified herein, the present inventors have isolated several wheat starch 
synthase genes using both hybridisation and polymerase chain reaction approaches, 
employing novel probes and primer sequences to do so. 

10 Accordingly, a third aspect of the invention provides an isolated probe or primer 
comprising at least about 15 contiguous nucleotides in length derived from any one of 
SEQ ID NOS: <400>1 , <400>3, <400>5, <400>7 or <400>9 or SEQ ID NOS: <400>1 1 
to <400>16, or a complementary nucleotide sequence thereto. 

15 Preferably, the probe or primer comprises a nucleotide sequence set forth in any one 
of SEQ ID NOS:<400>25 to <400>34. 

The isolated nucleic acid molecule of the present invention may be introduced into and 
expressed in any cell, for example a plant cell, fungal cell, insect cell, animal cell, yeast 
20 cell or bacterial cell. Those skilled in the art will be aware of any modifications which 
are required to the codon usage or promoter sequences or other regulatory 
sequences, in order for expression to occur in such cells. 

A further aspect of the invention provides a method of assaying for the presence or 
25 absence of a starch synthase isoenzyme or the copy number of a gene encoding same 
in a plant, comprising contacting a biological sample derived from said plant with an 
isolated nucleic acid molecule derived from any one of SEQ ID NOS: <400>1 , <400>3, 
<400>5, <400>7 or <400>9 or any one of SEQ ID NOS: <400>1 1 to <400>16, or any 
one of SEQ ID NOS:<400>25 to <400>34, or a complementary nucleotide sequence 
30 thereto for a time and under conditions sufficient for hybridisation to occur and then 
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detecting said hybridisation using a detection means. 

The detection means according to this aspect of the invention is any nucleic acid 
based hybridisation or amplification reaction. 

5 

The hexaploid nature of wheat prevents the straightforward identification of starch 
synthase allelic variants by hybridisation using the complete starch synthase-encoding 
sequence, because the similarities between the various alleles generally results in 
significant cross-hybridisation. Accordingly, sequence-specific hybridisation probes are 

-10 required to distinguish between the various alleles. Similarly, wherein PGR is used to 
amplify specific allelelic variants of a starch synthase gene, one or more sequence- 
specific amplification primers are generally required. As will be apparent from the 
amino acid sequence comparisons provided herein, such as in Figures 3 and 13, non- 
conserved regions of particular wheat starch synthase polypeptides are particularly 

15 useful for the design of probes and primers that are capable of distinguishing between 
one or more starch synthase polypeptide isoenzyme or allelic variant. The present 
invention clearly contemplates the design of such probes and primers based upon the 
sequence comparisons provided herein. 

20 In the performance of this embodiment of the present invention, the present inventors 
particularly contemplate the identification of wheat starch synthase null alleles or 
alternatively, mutations wherein specific amino acids are inserted or deleted or 
substituted , compared to one or more of the wheat SSII or SSIII alleles disclosed 
herein. Such null alleles and other allelic variants are readily identifiable using PGR 

25 screeening which employs amplification primers based upon the nucleotide and amino 
acid sequences disclosed herein for SSII and/or SSIII. Once identified, the various 
mutations can be stacked or pyramided into one or more new wheat lines, such as by 
introgression and/or standard plant breeding and/or recombinant approaches (eg. 
transformation, transfection, etc) thereby producing a novel germplasm which exhibits 

30 altered starch properties compared to existing lines. DNA markers based upon the 
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nucleotide and amino acid sequences disclosed herein for SSII and/or SSIII can be 
ennployed to nnonitor the stacking of genes into the new lines and to correlate the 
presence of particular genes with starch phenotypes of said lines. 

5 In this regard, a significant advantage conferred by the present invention Is the design 
of new DNA markers that reveal polymorphisms such as, for example, length 
polymorphisms, restriction site polymorphisms, and single nucleotide polymorphisms, 
amongst others, between wheat starch synthases and, in particular, between wheat 
GBSS and/or SSI and/or SSII and/or SSIII, or between allelic variants of one or more 
10 of said starch synthases, that can be used to identify the three genomes of hexaploid 
wheats (i.e., the A. B and D genomes). 

Preferably, such DNA markers are derived from the intron region of a starch synthase 
gene disclosed herein, more preferably the wheat SSII and/or the wheat SSIII gene. 

15 Those skilled in the art will be aware that such regions generally have a higher degree 
of variation than in the protein-encoding regions and, as a consequence, are 
particularly useful in identifying specific allelic variants of a particular gene, such as 
allelic variants contained in any one of the three wheat genomes, or alternatively or in 
addition, for the purpose of distinguishing between wheat GBSS, SSI. SSII or SSIII 

20 genes. 

A further approach contemplated by the present inventors is the design of unique 
isoenzyme-specific and/or allele-specific peptides based upon the amino acid 
sequence disclosed herein as SEQ ID NOS:<400>2 and/or <400>4 and/or <400>6 

25 and/or <400>8 and/or <400>10. which peptides are then used to produce polyclonal 
or monoclonal antibodies by conventional means. Alternatively, the genes encoding 
these polypeptides or unique peptide regions thereof can be introduced in an 
expressible format into an appropriate prokaryotic or eukaroyotic expression system, 
where they can be expressed to produce the isoenzyme-specific and/or allele-specific 

30 peptides for antibody production. Such antibodies may also be used as markers for the 
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purpose of both identifying parental lines and germplasms and monitoring the stacking 
of genes in new lines, using conventional immunoassays such as, for example, ELISA 
and western blotting. 

5 A further aspect of the present invention utilises the above-mentioned nucleic acid 
based assay method in the breeding and/or selection of plants which express or do not 
express particular starch synthase isoenzymes or alternatively, which express a 
particular starch synthase isoenzyme at a particular level in one or more plant tissues. 
This aspect clearly extends to the selection of transformed plant material which 
10 contains one or more of the isolated nucleic acid molecules of the present invention. 

Yet another aspect of the present invention provides for the expression of the nucleic 
acid molecule of the present invention in a suitable host (e.g. a prokaryote or 
eukaryote) to produce full length or non-full length recombinant starch synthase gene 
15 products. 

Hereinafter the term "starch synthase gene product" shall be taken to refer to a 
recombinant product of a starch synthase gene of the present invention. 

20 Preferably, the recombinant starch synthase gene product comprises an amino acid 
sequence having the catalytic activity of a starch synthase polypeptide or a functional 
mutant, derivative part, fragment, or analogue thereof. 

In a particularly preferred embodiment of the invention, the recombinant starch 

25 synthase gene product is selected from the following: 

(i) a wheat starch synthase II (wSSII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence which is at 
least about 85% identical overall to an amino acid sequence set forth in any one 
of SEQ ID NOS: <400>2, <400>4. or <400>6; 

30 (ii) a wheat starch synthase III (wSSfll) polypeptide, protein or enzyme or 
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functional subunit thereof which comprises an amino acid sequence which Is at 
least about 85% Identical overall to an amino acid sequence set forth in any one 
of SEQ ID NOS: <400>8 or <400>10;and 

(iii) a wheat starch synthase polypeptide, protein or enzyme or functional 
5 subunit thereof which comprises a conserved amino acid sequence having at 

least 25% identity to an amino acid sequence selected from the group 
consisting of: 

(a) KVGGLGDWTS; 

(b) GHTVEVILPKY; 

10 (c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 

(f) NGQWLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAMRYGS; and 
15 (h)TGGLVDTV 

and which is at least about 85% identical overall to an amino add sequence set 
forth in any one of SEQ ID NOS: <400>2, <400>4, <400>6, <400>8 or 
<400>10. 

20 Accordingly, the present invention clearly extends to homologues, analogues and 
derivatives of the amino acid sequences set forth herein as SEQ ID NOS: <400>2, 
<400>4, <400>6, <400>8 and <400>10. 

In the present context, "homologues" of an amino acid sequence refer to those 
25 polypeptides, enzymes or proteins which have a similar catalytic activity to the amino 
acid sequences described herein, notwithstanding any amino acid substitutions, 
additions or deletions thereto. A horriologue may be isolated or derived firom the same 
or another plant species as the species from which the polypeptides of the invention 
are derived. 

30 
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"Analogues" encompass polypeptides of the invention notwithstanding the occurrence 
of any non-naturally occurring amino acid analogues therein. 

"Derivatives" include modified peptides in which ligands are attached to one or more 
5 of the amino acid residues contained therein, such as carbohydrates, enzymes, 
proteins, polypeptides or reporter molecules such as radionuclides or fluorescent 
compounds. Glycosylated, fluorescent, acylated or alkylated forms of the subject 
peptides are particularly contemplated by the present invention. Additionally, 
derivatives of an amino acid sequence described herein which comprises fragments 
10 or parts of the subject amino acid sequences are within the scope of the invention, as 
are homopolymers or heteropolymers comprising two or more copies of the subject 
polypeptides. Procedures for derivatizing peptides are well-known in the art. 

Substitutions encompass amino acid alterations in which an amino acid is replaced 
15 with a different naturally-occurring or a non-conventional amino acid residue. Such 
substitutions may be classified as "conservative", in which an amino acid residue 
contained in a starch synthase gene product is replaced with another naturally- 
occurring amino acid of similar character, for example Gly<-^AIa, Val^lle^Leu, 
Asp^Glu, Lys<->Arg, Asn^GIn or Phe^Trp^Tyr. 

20 

Substitutions encompassed by the present invention may also be "non-conservative", 
in which an amino acid residue which is present in a starch synthase gene product 
described herein is substituted with an amino acid with different properties, such as a 
naturally-occurring amino acid from a different group (eg. substituted a charged or 
25 hydrophobic amino acid with alanine), or alternatively, in which a naturally-occurring 
amino acid is substituted with a non-conventional amino acid. 

Non-conventional amino acids encompassed by the invention include, but are not 
limited to those listed in Table 2. 
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Amino acid substitutions are typically of single residues, but may be of multiple 
residues, either clustered or dispersed. 

Amino acid deletions will usually be of the order of about 1-10 amino acid residues, 
5 while insertions may be of any length. Deletions and insertions may be made to the 
N-terminus, the C4erminus or be internal deletions or insertions. Generally, insertions 
within the amino acid sequence will be smaller than amino- or carboxy-terminal fusions 
and of the order of 1-4 amino acid residues. 

10 A homologue, analogue or derivative of a starch synthase gene product as referred to 
herein may readily be made using peptide synthetic techniques well-known in the art, 
such as solid phase peptide synthesis and the like, or by recombinant DNA 
manipulations. Techniques for making substituent mutations at pre-determined sites 
using recombinant DNA technology, for example by Ml 3 mutagenesis, are also well- 

15 known. The manipulation of nucleic acid molecules to produce variant peptides, 
polypeptides or proteins which manifest as substitutions, insertions or deletions are 
well-known in the art. 

The starch synthase gene products described herein may be derivatized further by the 
20 inclusion or attachment thereto of a protective group which prevents, inhibits or slows 
proteolytic or cellular degradative processes. Such derivatization may be useful where 
the half-life of the subject polypeptide is required to be extended, for example to 
increase the amount of starch produced in the endosperm or alternatively, to increase 
the amount of protein produced in a bacterial or eukaryotic expression system. 
25 Examples of chemical groups suitable for this purpose include, but are not limited to, 
any of the non-conventional amino acid residues listed in Table 2, in particular a D- 
stereoisomer or a methylated form of a naturally-occurring amino acid listed in Table 
1 . Additional chemical groups which are useful for this purpose are selected from the 
list comprising aryl or heterocyclic N-acyl substituents, polyalkylene oxide moieties, 
30 desulphatohirudin muteins, alpha-muteins, alpha-aminophosphonic acids, water- 
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soluble polymer groups such as polyethylene glycol attached to sugar residues using 
hydrazone or oxime groups, benzodiazepine dione derivatives, glycosyl groups such 
as beta-glycosylamine or a derivative thereof, isocyanate conjugated to a polyol 
functional group or polyoxyethylene polyol capped with dilsocyanate, amongst others. 
5 Similarly, a starch synthase gene product or a homologue, analogue or derivative 
thereof may be cross-linked or fused to itself or to a protease inhibitor peptide, to 
reduce susceptibility of said molecule to proteolysis. 

In a particularly preferred embodiment, the percentage similarity to in any one of SEQ 
10 ID NOS: <400>2. <400>4, <400>6, <400>8 or <400>10 is at least about 90%, more 
preferably at least about 95%, even more preferably at least about 97% and even 
more preferably at least about 98%, or about 99% or 100%. 

In a related embodiment, the present invention provides a "sequencably pure" form^of 
15 the amino acid sequence described herein. "Sequencably pure" is hereinbefore 
described as substantially homogeneous to facilitate amino acid determination. 

In a further related embodiment, the present invention provides a "substantially 
homogeneous" form of the subject amino acid sequence, wherein the term 

20 "substantially homogeneous" is hereinbefore defined as being in a form suitable for 
interaction with an immunologically interactive molecule. Preferably, the polypeptide 
is at least 20% homogeneous, more preferably at least 50% homogeneous, still more 
preferably at least 75% homogeneous and yet still more preferably at least about 95- 
100% homogenous, in terms of activity per microgram of total protein in the protein 

25 preparation. 

To produce the recombinant polypeptide of the present invention, the coding region 
of a starch synthase gene described herein or a functional homologue, analogue or 
derivative thereof is placed operably in connection with a promoter sequence in the 
30 sense orientation, such that a starch synthase gene product is capable of being 
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expressed under the control of said promoter sequence. 

In the present context, the term "in operable connection with" means that expression 
of the isolated nucleotide sequence is under the control of the promoter sequence with 
5 which it is connected, regardless of the relative physical distance of the sequences 
from each other or their relative orientation with respect to each other. 

Reference herein to a "promoter" is to be taken in its broadest context and includes the 
transcriptional regulatory sequences of a classical genomic gene, including the TATA 

10 box which is required for accurate transcription initiation, with or without a CCAAT box 
sequence and additional regulatory elements (i.e. upstream activating sequences, 
enhancers and silencers) which alter gene expression in response to developmental 
and/or external stimuli, or in a tissue-specific manner. A promoter is usually, but not 
necessarily, positioned upstream or 5', of a structural gene, the expression of which 

15 it regulates. Furthermore, the regulatory elements comprising a promoter are usually 
positioned within 2 kb of the start site of transcription of the gene. 

In the present context, the term "promoter" is also used to describe a synthetic or 
fusion molecule, or derivative which confers, activates or enhances expression of a 

20 structural gene or other nucleic acid molecule, particularly in a plant cell and more 
preferably in a wheat plant or other monocotyledonous plant cell, tissue or organ. 
Preferred promoters may contain additional copies of one or more specific regulatory 
elements, to further enhance expression and/or to alter the spatial expression and/or 
temporal expression. For example, regulatory elements which confer copper 

25 inducibility may be placed adjacent to a heterologous promoter sequence, thereby 
conferring copper inducibility on the expression of said molecule. 

Those skilled in the art will be aware that in order to obtain optimum expression of the 
starch synthase gene of the present invention, it is necessary to position said gene in 
30 an appropriate configuration such that expression is controlled by the promoter 
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sequence. Promoters are generally positioned 5' (upstream) to the genes that they 
control. In the construction of heterologous promoter/structural gene combinations it 
is generally preferred to position the promoter at a distance from the gene transcription 
start site that is approximately the same as the distance between that promoter and 
5 the gene it controls in its natural setting, i.e., the gene from which the promoter is 
derived. As is known in the art, some variation in this distance can be accommodated 
without loss of promoter function. Similarly, the preferred positioning of a regulatory 
sequence element with respect to a heterologous gene to be placed under its control 
is defined by the positioning of the element in its natural setting, i.e., the genes from 
10 which it is derived. Again, as is known in the art, some variation in this distance can 
also occur. 

Examples of promoters suitable for expressing the starch synthase gene of the present 
invention include viral, fungal, bacterial, animal and plant derived promoters capable 

15 of functioning in prokaryotic or eukaryotic cells. Prefen^ed promoters are those capable 
of regulating the expression of the subject starch synthase genes in plants cells, fungal 
cells, insect cells, yeast cells, animal cells or bacterial cells, amongst others. 
Particularly preferred promoters are capable of regulating expression of the subject 
nucleic acid molecules in monocotyledonous plant cells. The promoter may regulate 

20 the expression of the said molecule constitutively, or differentially with respect to the 
tissue in which expression occurs or, with respect to the developmental stage at which 
expression occurs, or in response to external stimuli such as physiological stresses, 
or plant pathogens, or metal ions, amongst others. 

25 Accordingly, strong constitutive promoters are particularly preferred for the purposes 
of the present invention. 

Examples of preferred promoters include the bacteriophage T7 promoter, 
bacteriophage T3 promoter, SP6 promoter, lac operator-promoter, tac promoter, SV40 
30 late promoter, SV40 early promoter, RSV-LTR promoter, CMV IE promoter, CaMV 35S 
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promoter, SCSV promoter, SCBV promoter and the like. 

Particularly preferred promoters operable in plant cells include, for example the CaMV 
35S promoter, and the SCBV promoter. Those skilled in the art will readily be aware 
5 of additional promoter sequences other than those specifically described. 

In a particularly preferred embodiment, the promoter may be derived from a genomic 
starch synthase gene. Preferably, the promoter sequence comprises nucleotide 
sequences that are linked in vivo to nucleotide sequences set forth in any one of SEQ 
10 ID NOs: <400>1, <400>3, <400>5, <400>7, <400>9. or any one of SEQ ID NOs: 
<400>11 to <400>16. By "linked in vivo" means that the promoter is present in its 
native state in the genome of a wheat plant where it controls expression of the starch 
synthase gene of the present invention. 

15 Conveniently, genetic constructs are employed to facilitate expression of a starch 
synthase genetic sequence of the present invention or a functional derivative, part, 
homologue, or analogue thereof. To produce a genetic construct, the starch synthase 
gene of the invention is inserted into a suitable vector or episome molecule, such as 
a bacteriophage vector, viral vector or a plasmid, cosmid or artificial chromosome 

20 vector which is capable of being maintained and/or replicated and/or expressed in the 
host cell, tissue or organ into which it is subsequently introduced. The said genetic 
construct comprises the subject nucleic acid molecule placed operably under the 
control of a promoter sequence and optionally, a terminator sequence. 

25 The term "terminator" refers to a DNA sequence at the end of a transcriptional unit 
which signals termination of transcription. Terminators are 3 '-non-translated DNA 
sequences containing a polyadenylation signal, which facilitates the addition of 
polyadenylate sequences to the 3'-end of a primary transcript. Terminators active in 
bacteria, yeasts, animal cells and plant cells are known and described in the literature. 

30 They may be isolated from bacteria, fungi, viruses, animals and/or plants. 
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Examples of terminators particularly suitable for use in expressing the nucleic acid 
molecule of the present invention in plant cells include the nopaline synthase (NOS) 
gene terminator of Agrobacterium tumefaciens, the terminator of the Cauliflower 
mosaic virus (CaMV) 35S gene, and the zein gene terminator from Zea mays. 

5 

Genetic constructs will generally further comprise one or more origins of replication 
and/or selectable marker gene sequences. 

The origin of replication can be functional in a bacterial cell and comprise, for example, 
10 the pUC or the C0IEI origin. Alternatively, the origin of replication is operable in a 
eukaryotic cell, tissue and more preferably comprises the 2 micron (2/2m) origin of 
replication or the SV40 origin of replication. 

As used herein, the term "selectable marker gene" includes any gene which confers 
15 a phenotype on a cell in which it is expressed to facilitate the identification and/or 
selection of cells which are transfected or transformed with a genetic construct of the 
invention or a derivative thereof. 

Suitable selectable marker genes contemplated herein include the ampicillin-resistance 
20 gene (Amp'), tetracycline-resistance gene (Tc'). bacterial kanamycin-resistance gene 
(Kan'), is the zeocin resistance gene (Zeocin is a drug of bleomycin family which is 
trademark of InVitrogen Corporation), the AURI-C gene which confers resistance to the 
antibiotic aureobasidin A, phosphinothricin-resistance gene, neomycin 
phosphotransferase gene (npfll), hygromycin-resistance gene, P-glucuronidase (GUS) 
25 gene, chloramphenicol acetyltransferase (CAT) gene, green fluorescent protein- 
encoding gene or the luciferase gene, amongst others. Those skilled in the art will be 
aware of other selectable marker genes useful in the performance of the present 
invention and the subject invention is not limited by the nature of the selectable marker 
gene. 

30 
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Usually, an origin of replication or a selectable marker gene suitable for use in bacteria 
is physically-separated from those genetic sequences contained in the genetic 
construct which are intended to be expressed or transferred to a eukaryotic cell, or 
integrated into the genome of a eukaryotic cell. 

5 

Standard methods can be used to introduce genetic constructs into a cell, tissue or 
organ for the purposes of modulating gene expression. Particularly preferred methods 
suited to the introduction of synthetic genes and genetic constructs comprising same 
to eukaryotic cells include liposome-mediated transfection or transformation, 
10 transformation of cells with attenuated virus particles or bacterial cells and standard 
procedures for the transformation of plant and animal cells, tissues, organs or 
organisms. Any standard means may be used for their introduction including cell 
mating, transformation or transfection procedures known to those skilled in the art or 
described by Ausubel et al. (1992). 

15 

In a further embodiment of the present invention, the starch synthase genes of the 
present invention and genetic constructs comprising same are adapted for integration 
into the genome of a cell in which it is expressed. Those skilled in the art will be aware 
that, in order to achieve integration of a genetic sequence or genetic construct into the 
20 genome of a host cell, certain additional genetic sequences may be required. In the 
case of plants, left and right border sequences from the T-DNA of the Agrobacterium 
tumefaciens Ti plasmid will generally be required. 

The invention further contemplates increased starch and/or modified starch 
25 composition in transgenic plants expressing the nucleic acid molecule of the invention 
in the sense orientation such that the activity of one or more starch synthase 
isoenzymes is increased therein. By increasing the level of one or more starch 
synthase isoenzymes, the deposition of starch in the amyloplast or chloroplast is 
increased and/or a modified starch granule structure is produced and/or starch 
30 composition is modified and/or the amylose/amylopectin ratio is altered in the plant. 
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Wherein it is desired to increase the synthesis of a particular starch synthase 
isoenzyme in a plant cell, the coding region of a starch synthase gene is placed 
operably behind a promoter, in the sense orientation, such that said starch synthase 
is expressed under the control of said promoter sequence. In a preferred embodiment, 
5 the starch synthase genetic sequence is a starch synthase genomic sequence, cDNA 
molecule or protein-coding sequence. 

Wherein it is desirable to reduce the level of a particular starch synthase isoenzyme 
in a plant cell, the nucleic acid molecule of the present invention can be expressed in 
10 the antisense orientation, as an antisense molecule or a ribozyme molecule, under the 
control of a suitable promoter. 

Alternatively, the nucleic acid molecule of the present invention may also be expressed 
in the sense orientation, in the form of a co-suppression molecule, to reduce the level 

15 of a particular starch synthase isoenzyme in a plant cell. As will be known to those 
skilled in the art, co-suppression molecules that comprise inverted repeat sequences 
of a target nucleic acid molecule provide optimum efficiency at reducing expression of 
said target nucleic acid molecule and, as a consequence, the present invention clearly 
contemplates the use of inverted repeat sequences of any one or more of the starch 

20 synthase genetic sequences exemplified herein, or inverted repeat sequences of a 
homologue, analogue or derivative of said starch synthase genetic sequences, to 
reduce the level of a starch synthase isoenzyme in a plant. 

The expression of an antisense, ribozyme or co-suppression molecule comprising a 
25 starch synthase gene in a cell such as a plant cell, fungal cell, insect cell, animal cell, 
yeast cell or bacterial cell, may also increase the availability of carbon as a precursor 
for a secondary metabolite other than starch (e.g. sucrose or cellulose). By targeting 
the endogenous starch synthase gene, expression is diminished, reduced or otherwise 
lowered to a level that results in reduced deposition of starch in the amyloplast or 
30 chloroplast and/or leads to modified starch granule structure and/or composition 
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and/or altered amylose/amylopectin ratio. 

Accordingly, a further aspect of the present invention provides a method of nnodifying 
the starch content and/or starch composition of one or more tissues or organs of a 
5 plant, comprising expressing therein a sense molecule, antisense molecule, ribozyme 
molecule, co-suppression molecule, or gene-targeting molecule having at least about 
85% nucleotide sequence identity to any one of any one of SEQ ID NOS: <400>1, 
<400>3, <400>5. <400>7 or <400>9 or any one of SEQ ID NOS: <400>1 1 to <400>16, 
or a complementary nucleotide sequence thereto for a time and under conditions 
10 sufficient for the enzyme activity of one or more starch synthase isoenzymes to be 
modified. This aspect of the invention clearly extends to the introduction of the sense 
molecule, antisense molecule, ribozyme molecule, co-suppression molecule, or gene- 
targeting molecule to isolated plant cells, tissues or organs or organelles by cell fusion 
or transgenic means and the regeneration of intact plants therefrom. 

15 

Co-suppression is the reduction in expression of an endogenous gene that occurs 
when one or more copies of said gene, or one or more copies of a substantially similar 
gene are introduced into the cell, preferably in the forni of an inverted repeat structure. 

20 The present inventors have discovered that the genetic sequences disclosed herein 
are capable of being used to modify the level of starch when expressed, particularly 
when expressed in plants cells. Accordingly, the present invention clearly extends to 
the modification of starch biosynthesis in plants, in particular wheat or barley plants or 
a progenitor plant species, or a relative thereto such as the diploid Triticum tauschii 

25 or other diploid, tetraploid, aneuploid, polyploid, nullisomic. or a wheat/barley addition 
line, amongst others. 

In particular, the present invention contemplates decreased starch production and/or 
modified starch composition in transgenic plants expressing the nucleic acid molecule 
30 of the invention in the antisense orientation or alteratively, expressing a ribozyme or 
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co-suppression molecule comprising the nucleic acid sequence of the invention such 
that the activity of one or more starch synthase isoenzymes is decreased therein. 

5 

In the context of the present invention, an antisense molecule is an RNA molecule 
which is transcribed from the complementary strand of a nuclear gene to that which is 
normally transcribed to produce a "sense" mRNA molecule capable of being translated 
into a starch synthase polypeptide. The antisense molecule is therefore 

10 complementary to the mRNA transcribed from a sense starch synthase gene or a part 
thereof. Although not limiting the mode of action of the antisense molecules of the 
present invention to any specific mechanism, the antisense RNA molecule possesses 
the capacity to form a double-stranded mRNA by base pairing with the sense mRNA, 
which may prevent translation of the sense mRNA and subsequent synthesis of a 

15 polypeptide gene product. 

Ribozymes are synthetic RNA molecules which comprise a hybridising region 
complementary to two regions, each of at least 5 contiguous nucleotide bases in the 
target sense mRNA. In addition, ribozymes possess highly specific endoribonuclease 
20 activity, which autocatalytically cleaves the target sense mRNA. A complete 
description of the function of ribozymes is presented by Haseloff and Gerlach (1988) 
and contained in International Patent Application No. WO89/05852. 

The present invention extends to ribozyme which target a sense mRNA encoding a 
25 native starch synthase gene product, thereby hybridising to said sense mRNA and 
cleaving it, such that it is no longer capable of being translated to synthesise a 
functional polypeptide product. 

According to this embodiment, the present invention provides a ribozyme or antisense 
30 molecule comprising at least 5 contiguous nucleotide bases derived from any one of 
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SEQ ID NOS: <400>1, <400>3, <400>5, <400>7 or <400>9 or any one of SEQ ID 
NOS: <400>11 to <400>16, or a complementary nucleotide sequence thereto or a 
homologue, analogue or derivative thereof, wherein said antisense or ribozyme 
molecule is able to form a hydrogen-bonded complex with a sense mRNA encoding 
5 a starch synthase gene product to reduce translation thereof. 

In a preferred embodiment, the antisense or ribozyme molecule comprises at least 10 
to 20 contiguous nucleotides derived from any one of SEQ ID NOS: <400>1, <400>3, 
<400>5, <400>7 or <400>9 or any one of SEQ ID NOS: <400>1 1 to <400>16, or a 

10 complementary nucleotide sequence thereto or a homologue, analogue or derivative 
thereof. Although the preferred antisense and/or ribozyme molecules hybridise to at 
least about 10 to 20 nucleotides of the target molecule, the present invention extends 
to molecules capable of hybridising to at least about 50-100 nucleotide bases in length, 
or a molecule capable of hybridising to a full-length or substantially full-length mRNA 

.15 encoded by a starch synthase gene. 

Those skilled in the art will be aware of the necessary conditions, if any, for selecting 
or preparing the antisense or ribozyme molecules of the invention. 

20 It is understood in the art that certain modifications, including nucleotide substitutions 
amongst others, may be made to the antisense and/or ribozyme molecules of the 
present invention, without destroying the efficacy of said molecules in inhibiting the 
expression of a starch synthase gene. It is therefore within the scope of the present 
invention to include any nucleotide sequence variants, homologues, analogues, or 

25 fragments of the said gene encoding same, the only requirement being that said 
nucleotide sequence variant, when transcribed, produces an antisense and/or 
ribozyme molecule which is capable of hybridising to a sense mRNA molecule which 
encodes a starch synthase gene product. 

30 Gene targeting is the replacement of an endogenous gene sequence within a cell by 
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a related DNA sequence to which it hybridises, thereby altering the form and/or 
function of the endogenous gene and the subsequent phenotype of the cell. According 
to this embodiment, at least a part of the DNA sequence defined by any one of SEQ 
ID NOS: <400>1, <400>3, <400>5, <400>7 or <400>9 or any one of SEQ ID NOS: 
5 <400>11 to <400>16 may be introduced into target cells containing an endogenous 
gene that encodes a particular starch synthase isoenzyme, thereby replacing said 
endogenous gene. According to this embodiment, the polypeptide product of the gene 
targetting molecule generally encodes a starch synthase isoenzyme that possesses 
different catalytic activity to the polypeptide product of the endogenous gene, 
10 producing in turn modified starch content and/or composition in the target cell. 

The present invention extends to genetic constructs designed to facilitate expression 
of a sense molecule, an antisense molecule, ribozyme molecule, co-suppression 
molecule, or gene targeting molecule of the present invention. The requirements for 
15 expressing such molecules are similar to those for expressing a recombinant 
polypeptide as described supra. 

The present invention further extends to the production and use of starches produced 
by the application of the novel genes described herein. 

20 

Starch hydrolysates or undegraded starches are both useful in industry and, as a 
consequence, the present invention is useful in applications relating to the use of both 
starch hydrolysates and undegraded starches. By "starch hydrolysates" is meant the 
glucose and glucan components that are obtainable by the enzymatic or chemical 
25 degradation of starch in chemical modifications and processes, such as fermentation. 

For example, starch produced by plants expressing the sense, antisense, co- 
suppression, gene-targetting or ribozyme molecules of the present invention may 
exhibit modified viscosities and/or gelling properties of its glues when compared to 
30 starch derived from wild-type plants. Native starches produced by the performance of 
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the inventive method are useful as an additive in the following: (i) foodstuffs, for the 
purpose of increasing the viscosity or gelling properties of food; (ii) in non-foodstuffs, 
such as an adjuvant or additive in the paper and cardboard industries, for retention or 
as a size filler, or as a solidifying substance or for dehydration, or film coating, 
5 amongst others; (iii) in the adhesive industry as pure starch glue, as an additive to 
synthetic resins and polymer dispersions, or as an extenders for synthetic adhesives; 
(Iv) in the textile and textile care industries to strengthen woven products and reduce 
burring or to thicken dye pastes; (v) in the building industry, such as a binding agent 
in the production of gypsum plaster boards, or for the deceleration of the sizing 

10 process; (vi) in ground stabilization or for the temporary protection of ground particles 
against water in artificial earth shifting; (vii) as a wetting agent in plant protectants and 
fertilizers; (viii) as a binding agent in drugs, pharmaceuticals and medicated foodstuff 
such as vitamins, etc; (ix) as an additive in coal and briquettes; (xi) as a flocculent in 
the processing of coal ore and slurries; (xii) as a binding agent in casting processes 

IS to increase flow resistance and improve binding strength; and (xiii) to improve the 
technical and optical quality of rubber and plastic products. Additional applications are 
not excluded. 

A further aspect of the present invention provides an isolated promoter that is operable 
20 in the endosperm of a monocotyledonous plant cell, tissue or organ, and preferably in 
the endosperm of a monocotyledonous plant cell, tissue or organ. According to this 
embodiment, it is preferred that the promoter is derived from a starch synthase gene 
of the present invention, such as a promoter that is linked in vivo to any one of SEQ 
ID NOS: <400>1, <400>3, <400>5, <400>7 or <400>9 or any one of SEQ ID NOS: 
25 <400>1 1 to <400>16, or a complementary nucleotide sequence thereto. 

In a particulariy preferred embodiment, the promoter comprises a nucleotide sequence 
derivable from the 5'-upstream region of SEQ ID NO:<400>11 or a complementary 
nucleotide sequence thereto, an more preferably comprises nucleotides 1 to about 287 
30 of SEQ ID NO:<400>1 1 or nucleotides 1 to about 287 of SEQ ID NO:<400>1 1 or a 
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complementary nucleotide sequence thereto. The present invention clearly extends 
to promoter sequences that comprise further nucleotide sequences in the region 
upstream of the stated nucleotide sequence that are linked in vivo to said nucleotide 
sequence in the wheat genome. 

5 

In a related embodiment, the promoter sequence of the present invention will further 
comprise an exon sequence derived from a starch synthase gene, for example 
nucleotides 260 to 385 of SEQ ID NO:<400>11 or a complementary nucleotide 
sequence thereto. Those skilled in the art will be aware that the inclusion of such 
10 nucleotide sequences may increase the expression of a heterologous structural gene, 
the expression of which is controlled thereby. 

The present invention further extends to the expression of any structural gene operably 
under the control of the starch synthase promoter sequence exemplified herein opa 
15 functional homologue, analogue or derivative of said promoter sequence. 

As with other embodiments described herein for expression in cells, a genetic 
construct may be employed to effect said expression and the present invention clearly 
extends to said genetic constructs. 

20 

The polypeptide encoded by the structural gene component may be a reporter 
molecule which is encoded by a gene such as the bacterial p-g!ucuronidase gene or 
chloramphenicol acetyltransferase gene or alternatively, the firefly luciferase gene. 
Alternatively, wherein it is desirable to alter carbon partitioning within the endosperm, 
25 the polypeptide may be an enzyme of the starch sucrose biosynthetic pathways. 
Preferably, the promoter sequence is used to regulate the expression of one or more 
of the starch synthase genes of the present invention or a sense, antisense, ribozyme, 
co-suppression or gene-targetting molecule comprising or derived from same. 

30 Recombinant DNA molecules carrying the aforesaid nucleic acid molecule of the 
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present invention or a sense, antisense, ribozyme, gene-targetting or co-suppression 
molecule and/or genetic construct comprising same, may be introduced into plant 
tissue, thereby producing a "transgenic plant", by various techniques known to those 
skilled in the art. The technique used for a given plant species or specific type of plant 
5 tissue depends on the known successful techniques. Means for introducing 
recombinant DNA into plant tissue include, but are not limited to, transformation 
(Paszkowski etaL, 1984), electroporation (Fromm etaL, 1985), or microinjection of the 
DNA (Crossway et aL, 1986), or T-DNA-mediated transfer from Agrobacterium to the 
plant tissue. Representative T-DNA vector systems are described in the following 

10 references: An ef a/.(1985); Herrera-Estrella et aL (1983a,b); Herrera-Estrella et aL 
(1985). Once introduced into the plant tissue, the expression of the introduced gene 
may be assayed in a transient expression system, or it may be determined after 
selection for stable integration within the plant genome. Techniques are known for the 
in vitro culture of plant tissue, and in a number of cases, for regeneration into whole 

15 plants. Procedures for transferring the introduced gene from the originally transformed 
plant into commercially useful cultivars are known to those skilled in the art. 

in general, plants are regenerated from transformed plant cells or tissues or organs on 
hormone-containing media and the regenerated plants may take a variety of forms, 

20 such as chimeras of transformed cells and non-transformed cells; clonal transformants 
(e.g., ail cells transformed to contain the expression cassette); grafts of transformed 
and untransformed tissues (e.g., a transformed root stock grafted to an untransformed 
scion in citrus species). Transformed plants may be propagated by a variety of means, 
such as by clonal propagation or classical breeding techniques. For example, a first 

25 generation (or T1) transformed plants may be selfed to give homozygous second 
generation (or T2) transformed plants, and the T2 plants further propagated through 
classical breeding techniques. 

Accordingly, a still further aspect of the present invention contemplates a transgenic 
30 plant comprising an introduced sense molecule, antisense molecule, ribozyme 
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molecule, co-suppression molecule, or gene-targeting molecule having at least about 
85% nucleotide sequence identity to any one of any one of SEQ ID NOS: <400>1, 
<400>3, <400>5, <400>7 or <400>9 or any one of SEQ ID NOS: <400>1 1 to <400>16. 
or a complementary nucleotide sequence thereto or a genetic construct comprising 
5 same. 

The present invention further extends to those plant parts, propagules and progeny of 
said transgenic plant or derived therefrom, the only requirement being that said 
propagules and progeny also carry the introduced nucleic acid molecule(s). 

10 

The present invention is further described by reference to the following non-limiting 
examples. 

15 EXAMPLE 1 

Plant material 

Genetic stocks of hexapioid bread wheat Triticum aestivum cv. Chinese Spring with 
various chromosome additions and deletions were kindly supplied by Dr E. Lagudah 
(CSIRO Plant Industry, Canberra) and derived from stocks described in Sears and 
20 Miller (1985). The hexapioid {Triticum aestivum) wheats cv Gabo and cv Wyuna were 
grown in controlled growth cabinet conditions (18 "^C day and 13P C night, with a 
photoperiod of 16 h). Wheat leaves and florets prior to anthesis, and endosperm were 
collected over the grain filling period, immediately frozen in liquid nitrogen and stored 
at -80X until required. 

25 

EXAMPLE 2 

Gel Electrophoresis, Antibodies and Immunoblotting 

Monoclonal antibodies against the Sgp-1 proteins, and their use in the immunoblotting 
of SDS-PAGE gels have been described previously (Rahman ef a/., 1995). 
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EXAMPLE 3 
Preparation of total RNA from wheat 

Total RNA was isolated from the leaf, floret and endosperm tissues of wheat 
essentially as described by Higgins et ai (1976) or Rahman et al. (1998). RNA was 
5 quantified by UV absorption and by separation in 1.4% (w/v) agarose-formaldehyde 
gels which were then visualised under UV light after staining with ethidium bromide. 

EXAMPLE 4 
Construction and screening of cDNA libraries 

10 A first cDNA library, an expression cDNA library of wheat endosperm, was constructed 
from mRNA isolated from wheat cv Chinese Spring. RNA from 5, 7. 9, 11 and 13 days 
after anthesis was pooled and random primers were used for the first strand of cDNA 
synthesis. Monoclonal antibodies against 100 -105 kDa proteins in wheat starch 
granules (Rahman et al., 1995) were used for immunoscreening of the expression 

15 cDNA library. 

A second cDNA library was constructed from the endosperm mRNA of the hexaploid 
Triticum aestivum cultivar Wyuna, 8-12 days after anthesis, as described by Rahman 
et ai (1997). This library was screened with a 85-bp cDNA fragment, wSSIIpl , which 
20 was obtained by immunoscreening of the expression cDNA library as described above. 
The wSSIIpl probe corresponded to nucleotide positions 988 to 1072 of wSSIIB (SEQ 
ID NO:<400>1) at the hybridisation conditions as described eariier (Rahman et ai, 
1998). 

25 A third cDNA library was constructed from RNA from the endosperm of the hexaploid 
Triticum aestivum cultivar Rosella as described by Rahman et ai (1997). This library 
was screened with a 347-bp cDNA fragment, wSSIIIpl for the first screening and a 
478-bp cDNA fragment wSSIIIp3 for the second screening ( PLEASE ADVISE- 
nucleotides 2469 to 2947 of SEQ ID NO:<400>7) using the hybridisation conditions 

30 described herein. 
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EXAMPLE 5 

Construction and screening of Triticum tauschii genomic library 

The genomic library used in this study, prepared from Triticum tauschii, var 
strangulata, (Accession Number CP! 110799), has been described in Rahman ef a/., 
5 (1997). Of all the accessions of 7. tauschii surveyed. DNA marker analysis suggests 
that the genome of CPI 110799 is the most closely related to the D genome of 
hexaploid wheat (Lagudah etal., 1991). 

Hybridisations were carried out in 25% formamide, 6 x SSC, 0.1% SDS at 42*^0 for 16 
10 hours, then filters were washed 3 times using 2 x SSC containing 0.1% SDS at 65 °C 
for 1 hour per wash. 

For the isolation of a genomic wSSII clone, the probe comprised the PCR-derived DNA 
fragment wSSIIp2 and positive-hybridising plaques were digested using the restriction 
15 enzyme BamHI, separated on a 1% agarose gel, transferred to nitrocellulose 
membrane and hybridised to probe wSSIIp4 comprising nucleotides 1 to 367 of the 
wSSIIA cDNA clone, using the conditions described by Rahman et ai (1997). 

For the isolation of a genomic wSSIII clone, plaques hybridising to the PCR-derived 
20 DNA fragment wSSIIIpl from clone wSSIILB3 (i.e. nucleotides 3620 to 3966 of SEQ 
ID NO:<400>7) were selected and re-screened until plaque-purified. 



EXAMPLE 6 

25 DNA sequencing and analysis 

DNA sequencing was performed using the automated ABI system with dye terminators 
as described by the manufacturers. DNA sequences were analysed using the GCG 
suite of programs (Devereaux et al., 1984). 



30 
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EXAMPLE 7 
DNA and RNA analysis 
DNA was isolated and analysed as previously described (Maniatis et ai, 1982; 
Rahman et ai, 1998). Approximately 20 /^g of DNA was digested with restriction 
5 enzymes SamHI, Dra\ and EcoRI, separated on a 1% agarose gel and transferred to 
reinforced nitrocellulose membranes (BioRad) and hybridised with ^^P-labelled DNA 
probe, either wSSI I Ipl, corresponding to nucleotides 3620 to 3966 of the wheat SSIII 
gene, or alternatively, with the entire wSSlI cDNA clone. DNA fragment probes were 
labelled with the Rapid Multiprime DNA Probe Labelling Kit (Promega). 

10 

The hybridisation and wash conditions were performed as described in Rahman et al. 
(1997). For RNA analysis, 10 A<g of total RNA was separated in a 1.4% agarose- 
formaldehyde gel and transferred to a Hybond N+ membrane (Amersham), and 
hybridised with cDNA probe at 42°C as previously described by Khandjian et ai, 
15 (1987) or Rahman et ai, (1998). After washing for 30 minutes at 65°C with 2x SSC, 
0.1% SDS; followed by three washes of 40 minutes at 65°C with 0.2x SSC. 1 % SDS, 
the membranes were visualised by overnight exposure at -80*0 with Kodak MR X-ray 
film. 

20 

EXAMPLE 8 

Expression of wheat Sgp-1 polypeptides in the wheat endosperm 

The development and use of monoclonal antibodies to the Sgp-1 proteins has been 
described previously (Rahman et ai, 1995). These antibodies were used by the 
25 present inventors to characterise the expression and localisation of the Sgp-1 proteins. 

The proteins found in the matrix of the wheat starch granule are shown in Figure 1 , 
lanel . The remaining lanes show an immunobiot of proteins from the soluble phase 
(Figure 1; lanes 2-4) and the starch granule (Figure 1; lanes 5-7), respectively, 
30 following SDS-PAGE. In addition to cross-reactivity with the 100-105 kDa proteins, a 
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weak cross-reaction with a 50 kDa protein in both the granule and the soluble fractions 
were observed (Figure 1). The Sgp-1 polypeptides are present in the starch granule 
throughout endosperm development (Figure 1; lanes 5-7, also see Rahman et al., 
1995). However, as the endosperms matures, there is a reduction in the amount of 
5 Sgp-1 protein found in the soluble fraction. Lane 4 shows that by 25 days after 
anthesis, the level of these proteins in the soluble fraction is substantially reduced. 
This observation is consistent with previous results from Rahman etaL, (1995), who 
suggested that the Sgp-1 proteins were exclusively granule bound based on studies 
of granules from endosperm in mid-late stages endosperm development, however, 
10 these results suggest that the partitioning of these proteins between the granule and 
the soluble phase changes during development. 

EXAMPLE 9 

Isolation of cDNA clones encoding wheat starch synthase 11 (wSSII) proteins 

15 Monoclonal antibodies against Sgp-1 polypeptides (Rahman ef a/., 1995) were used 
to probe the expression library described in Example 4 (i.e. the first cDNA library). 
Three immunoreactive plaques were identified and sequenced. One clone, designated 
wSSIIpl, contained an 85-bp cDNA insert with homology to maize SSIIa (Harn et aL, 
1998). 

20 

DNA from the wSSIIpl clone was used as a probe in the hybridisation screening of the 
second cDNA library, prepared from Triticum aestivum cultivar\Afyuna endosperm RNA 
as described in Example 4. Ten hybridising cONA clones were selected and 
sequenced. On the basis of the DNA sequences obtained, the 10 cDNA clones can be 
25 classified into three groups. Group 1 contains 7 cDNA clones, group 2 contains 2 
cDNA clones and group 3 contains 1 cDNA clone. 

The longest clone from group 1 (designated wSSIIB) is 2939 bp in length (SEQ ID 
NO:<400>1) and encodes a 798 -amino acid polypeptide starting at nucleotide 176 
30 and terminating at nucleotide 2572 (SEQ ID NO:<400>2). 
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The longest clone from group 2 (designated wSSIIA) is 2807 bp in length (SEQ ID 
NO:<400>3) and encodes a 799 -amino acid polypeptide starting at nucleotide 89 and 
terminating at nucleotide 2488 (SEQ ID NO:<400>4). 

5 The cDNA from group 3 is a partial cDNA clone (designated wSSIID), which is 2107 
bp in length (SEQ ID NO:<400>5) and encodes a 597 -amino acid polypeptide starting 
at nucleotide 1 and terminating at nucleotide 1794 (SEQ ID NO:<400>6). The encoded 
polypeptide is approximately a 200 amino acid residues shorter than that of 
polypeptides encoded by longest clones of group 1 or 2 clones, respectively (Figure 
10 2). 

Comparison of the three cDNA clones, wSSIIB, wSSIIA and wSSIID shows that they 
share 95.7% to 96.6% identity at amino acid level, with variation at 44 amino acid 
positions between the three sequences (Figure 3). Of the 44 amino acid changes 

15 between these sequences, 31 changes occur in the N-terminal region (residues 1 to 
300), 10 changes occur in the central region (residues 301 to 729) and 3 changes 
occur in the C-terminal region (residues 730 to 799). The wSSIIA polypeptide (799 
amino acid residues) and wSSIIB polypeptide (798 amino acid residues) sequences 
differ in length by a single amino acid residue, due to the deletion of Asp-69 from the 

20 wSSIIB polypeptidesequence. 

A comparison of the nucleotide sequences of the wSSIA, wSSIIB and wSSIID cDNA 
clones with the nucleotide sequence of the wSSIIpl cDNA obtained by 
immunoscreening confirms that the wSSIIpl sequence is found in each cDNA. The 

25 peptide encoded by the wSSIIpl cDNA clone corresponds to amino acid residues in 
the region from residue 272 to residue 298 of the wSSIIA polypeptide, and to amino 
acid residues in the region from residue 271 to residue 297 of the wSSIIB polypeptide 
(see Figure 3). Thus, the peptide epitope encoded by wSSIIpl that reacts with the 
anti-Sgp-1 monoclonal antibodies can therefore be localised to this region of the 

30 wSSIIA and wSSIIB polypeptides and to the corresponding region of the wSSIID 
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polypeptide. 

Notwithstanding that a region having about 63% amino acid sequence identity to the 
peptide epitope encoded by clone wSSIIpl is found in the maize SSIIa polypeptide 
5 (Figure 3), the degree of amino acid conservation between maize and wheat 
sequences in this region of the polypeptide is insufficient for immunological cross- 
reactivity to occur between these species using the monoclonal antibodies to the 
wheat Sgp-1 proteins described by Rahman et aL (1995). Additionally, this peptide 
epitope is not found in granule-bound starch synthases, SSI, or SSIII (data not shown). 

10 

The wSSIIB cDNA (SEQ ID NO:<400>1) encodes an amino acid sequence comprising 
the peptide motif AAGKKDAGID (SEQ ID NO:<400>18) between residues 60 and 69 
of SEQ ID NO:<400>2 (Figure 3) which, with the exception of the second residue, is 
identical to the N-terminal of the 100 kDa (AAGKKDAGID: SEQ ID NOs:<400>19 and 
15 20) protein (Sgp-B1) from the wheat starch granule (note that the sequence given in 
Rahman etaL, 1995 (AVlGKKDAL: SEQ ID NOs:<400>21 and 22 ) has been revised 
following further amino acid sequence analysis). 

The wSSIIA cDNA clone (SEQ ID NO:<400>3) encodes an amino acid sequence 
20 comprising the peptide motif AAGKKDARVDDAA (SEQ ID NO: <400>23) at residues 
60 to 73 of SEQ ID NO:<400>4, which is about 66% identical to the N-terminal amino 
acid sequence (i.e. ALGKKDAGIVDGA: SEQ ID NO: <400>24) of the 104 kDa and 
105 kDa starch granule proteins, Sgp-DI and Sgp-AI respectively, as determined by 
sequence analysis of isolated protein (Rahman ef a/., 1995). 

25 

Furthermore, Takaoka ef a/. (1997) reported the amino acid sequences of 3 
polypeptides obtained from sequencing starch granule proteins derived from the Sgp-1 
proteins. Peptide 3 described by Takaoka et aL (1997) corresponds to amino acid 
residues 378 to 387 of the amino acid sequence of the wSSIIA cDNA (SEQ ID 
30 NO:<400>4; Figure 3). Peptides 1 and 2 described by Takaoka etaL (1997) could not 
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be detected in the amino acid sequences of the wSSII cDNA clones of the present 
invention, however peptide 1 of Takaoka etaL (1997) can be found in the amino acid 
sequences of SSI from maize, rice, wheat and potato (data not shown). 

5 Denyer et al. (1995) demonstrated that the Sgp-1 proteins possess starch synthase 
activity and, as a consequence, the wSSIIB, wSSIA and wSSIID cDNA clones encode 
starch synthase enzymes that are differentially expressed in a developmentally- 
regulated manner in both the soluble and granule-bound fractions of the endosperm 
(Figure 1), Based on the nomenclature suggested by Harn et ai (1998). it is 
10 appropriate to describe the Sgp-1 proteins as "starch synthases" rather than "granule- 
bound starch synthases". 



EXAMPLE 10 

IS Analysis of wheat starch synthase II mRNA expression 

The mRNA for wheat starch synthase II could be detected in leaves, pre-anthesis 
florets and endosperm of wheat when total RNAs isolated from these tissue were 
probed with a PGR probe, wSSIIp2, corresponding to nucleotide positions 1435 to 
1835 bp of wSSIIB-cDNA (SEQ ID NO:<400>1; Figure 4). Unlike wSSI. which could 

20 not be detected in wheat leaves derived from plants grown under the same conditions, 
wSSII genes are highly-expressed in the leaves (Figure 4, lane 1), and expressed at 
an intermediate level In pre-anthesis florets (Figure 4, lane 2), and at much lower levels 
in developing wheat endosperm cells (Figure 4, lanes 3-11). In contrast, the maize 
SSIIa is expressed predominantly in the endosperm, whilst the maize SSIIb is detected 

25 mainly in the leaf, albeit at low levels (Harn et aL, 1998). 

The wSSII mRNA was detectable in the endosperm 6 days after anthesis and mRNA 
levels increase between 8 and 18 days post-anthesis, after which time levels of mRNA 
decline. 
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Southern blotting experiments in wheat demonstrated that the wSSIIp2 probe used 
detected only a single copy of the SSII gene in each genome (data not shown). Thus, 
it is unlikely that this probe cross-hybridised with mRNAs encoded by genes other than 
wSSII. 

5 

EXAMPLE 11 

Chromosomal localization of the wheat wSSII genes. 

10 I. Amplification of specific cDNA regions of wheat starch synthase 11 using PCR 

Two PCR products. wSSIIp2 and wSSIIp3 were amplified from the cDNA clone wSSIIB 
and used for the northern hybridisation and Southern hybridisation, respectively. 

The primers sslla (5' TGTTGAGGTTCCATGGCACGTTC 3': SEQ ID NO: <400>26) 
15 and ssllb (5' AGTCGTTCTGCCGTATGATGTCG 3': SEQ ID NO: <400>26) were used 
to amplify the cDNA fragment wSSIIp2 (i.e. nucleotide positions 1435 to 1 835 of SEQ 
ID NO:<400>1). 

The primers ssllc (5' CCAAGTACCAGTGGTGAACGC 3': SEQ ID NO: <400>27) and 
20 sslld (5' CGGTGGGATCCAACGGCCC 3': SEQ ID NO: <400>28) were used to amplify 
the cDNA fragment wSSIIpS (i.e. nucleotide positions 2556 to 2921 of SEQ ID 
NO:<400>1). 

The amplification reactions were performed using a FTS-1 thermal sequencer (Corbett, 
25 Australia) for 1 cycle of 95^C for 2 minutes; 35 cycles of 95^=^0 for 30 seconds, 60°C for 
1 minutes, 72*^0 for 2 minutes and 1 cycle of 25°C for 1 minute. 

II. PCR and nucleotide sequence analysis of 3' sequences of wheat SSII genes 
Genomic DNA was extracted from wild-type Chinese Spring wheat, and from three 
30 nullisomic-tetrasomic lines of chromosome 7 of Chinese Spring wheat, and from 
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Triticum tauschii (var strangulata, accession number CPI 100799), and used as a 
template for the amplification and nucleotide sequence analysis of wheat SSI I genes. 

RFLP analysis oi BamHl and EcoRI restricted DNAfrom each wheat or T. Tauschii line 
5 was carried out using the wSSIIp3 fragment as a probe. Three hybridising bands were 
obtained which could be assigned to chromosomes 7A, 7B and 7D, respectively (data 
not shown). This analysis Indicates that there Is a single copy of the wSSII gene in 
each genome In hexaploid wheat, consistent with the findings of Yamamori and Endo 
(1 996) who located the SGP-A1 , B1 and D1 proteins to the short arm of chromosome 
10 7. 

PGR analysis was used to assign each of the cDNA clones to the Individual wheat 
genomes. A single 365 bp PGR fragment was obtained from nullisomic-tetrasomic 
genomic DNA of Ghinese Spring when primers ssllc and sslld were used for the PGR 

15 amplification (Figure 5, right panel). This PGR product is obtained only from lines 
bearing the B genome. The fragment was cloned and sequenced and shown to be 
identical to a 365 bp region of the wSSIIB cDNA. An identical fragment is obtained by 
PGR amplification of the wSSIIB cDNA clone, but not by amplification of the wSSIIA 
or wSSIID clones, supporting the conclusion that the wSSIIB cDNA is the product of 

20 a gene located on chromosome 7 of the B genome of hexaploid wheat. 

Two PGR products were also amplified from nullisomic-tetrasomic genomic DNA of 
Ghinese Spring using the primers ssllc and sslle (Figure 5, left panel). One PGR 
fragment, approximately 350 bp is only amplified when the A genome is present, and 
25 a second 322 bp product is only amplified when the D-genome is present. The 350 and 
322 bp PGR products were also cloned and sequenced and shown to be identical to 
the wSSIIA and wSSIID cDNAs, respectively, supporting the conclusion that the 
wSSIIA and wSSIID cDNAs are the products of genes located on chromosomes 7A 
and 7D, respectively. 

30 
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EXAMPLE 12 
Isolation of genomic wSSII clones 

Screening of a genomic library from the D-genome donor of wheat, T. tauschii, was 
performed as described in Example 5, using the the PCR-derived DNA fragment 
5 wSSIIp2 as a hybridisation probe. Figure 6 shows an example of a plaque lift showing 
the positive-hybridising clone wSSII-8, a putative T. fauscM homologue of the wSSII 
gene. 

Positive-hybridising plaques were digested using the restriction enzyme BamHI, 
10 separated on a 1% agarose gel, transferred to nitrocellulose membrane and hybridised 
to probe wSS!lp4 comprising nucleotides 1 to 367 of the wSSIIA cDNA clone, using 
the conditions described by Rahman etaL (1997). As shown in Figure 7,clone wSSII-8 
also hybridises strongly to this probe, confirming its identity as a genomic wSSII gene. 
Furthermore, in light of the fact that the wSSIIp4 probe comprises the 5'-end of the 
15 cDNA clone, it is likely that genomic clone wSSII-8 comprises the promoter region of 
the wSSII gene. 
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EXAMPLE 13 

Cloning of specific cDNA regions of wheat starch synthase ill using RT-PCR 

PGR primers were used to amplify sequences of starch synthase III from wheat 
endosperm cDNA. The design of PGR primers was based on the sequences of starch 
5 synthase III from potato and the du1 starch synthase III gene of maize. 

First-strand cDNAs were syntheslsed from 1 fxg of total RNA (derived from endospemn 
of the cultivar Rosella, 12 days after anthesis) as described by Maniatis et ai (1982), 
and then used as templates to amplify two specific cDNA regions, wSSIIIpl and 
10 wSSIIIp2, of wheat starch synthase III by PGR. 

The primers used to obtain the cDNA clone wSSIIIpl were as follows: 

Primer wSS3pa (5' GGAGGTGTTGGTGATGTTGT 3': SEQ ID NO: <400>29); and 

Primer wSSSpb (5' CTTGACCAATCATGGCAATG 3': SEQ ID NO: <400>30). 

15 

The primers used to obtain the cDNA clone wSSIIIp2 were as follows: 

Primer wSS3pc (5' CATTGCCATGATTGGTCAAG 3': SEQ ID NO: <400>31); and 

Primer wSS3pd (5* ACCACCTGTCCGTTCCGTTGG 3': SEQ ID NO: <400>32). 

20 The amplified clones wSSIIIpl and wSSIIIp2 were used as probes to screen the third 
cDNA library and T. tauschii genomic DNA library as described in Example 4. 

A further probe designated wSSIIIp3 was used for screening the third cDNA library, as 
described in Example 4. Probe wSSIIIp3 was amplified by PGR from a cDNA clone 
25 produced from the first screening using the following amplification primers: 

Primer wSS3pe (5' GCACGGTCTATGAGAACAATGGC 3': SEQ ID NO:<400>33); and 
Primer wSSSpf (5' TCTGCATACCACCAATCGCCG 3': SEQ ID NO: <400>34). 

The amplification reactions were performed using a FTS-1 or FTS4000 thermal 
30 sequencer (Corbett. Australia) for 1 cycle of 95°C for 2 minutes; 35 cycles of 95°C for 
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30 seconds, 60°C for 1 minutes, ll^'C for 2 minutes and 1 cycle of 25^*0 for 1 minute. 

Amplified sequences of the expected length were obtained, cloned and sequenced, 
and shown to contain DNA sequences highly homologous to the maize and potato 
5 SSIII genes. PGR fragments were subsequently used to probe a wheat cDNA library 
by DNA hybridisation and 8 positive clones were obtained, including one 3 kb cDNA. 
A region from the 5' end of this cDNA was amplified by PGR and used a probe for a 
second round of screening the cDNA library, obtaining 8 cDNA clones. Of these, one 
cDNA was demonstrated to be full length (wSSIII.BS, 5.36 kb insert). The sequence 
10 of the 5.36 kb wSSIII.B3 cDNA clone is given in SEQ ID NO:<400>7. 

Sequencing of the 8 cDNA clones obtained from the second round screening of the 
wheat cDNA library revealed that there were at least 2 classes of cDNA encoding SSIII 
present, possibly being encoded by homeologous genes on different wheat genomes. 
15 The sequence of a representative of this second class of cDNA clones, wSSIII.BI , is 
shown in SEQ ID NO:<400>9. The 3664 bp clone wSSIII.BI is not full length, spanning 
only the region from nucleotides 1690 to 5363 of the homeologous clone wSSIII.B3, 
with an open reading frame between nucleotide positions 1 and 3180. 

20 An open reading frame is found between the ATG translation start codon at position 
29 and the stop codon at position 4921 of the cDNA clone wSSIII.BS (SEQ ID 
NO:<400>7). The amino acid sequence deduced from this open reading frame is 
shown in SEQ ID NO:<400>8. 

25 An alignment of the deduced amino acid sequences of SSIII from maize, potato and 
wheat is shown in Figure 8. There is 56.6% identity between the maize SSIII and 
wheat WSSIII.B3 sequences at the amino acid level. 

The G-terminal domain of starch synthases comprise the catalytic domain, and a 
30 characteristic amino acid sequence motif KVGGLGDWTSLSRAVQDLGHNVEV (SEQ 
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ID NO:<400> 35) in maize, or alternatively KVGGLGDWTSLSRAIQDLGHTVEV (SEQ 
ID NO: <400>36) in wheat, marking the first conserved region in the C-terminal 
domain. 

5 The amino acid identity between maize dulM and wSSIIl.BS in the N-terminal region 
(i.e. amino acids 1 to 600 in Figure 8) is only 32.2%; whilst the amino acid identity in 
the central region (i.e. amino acids 601 to 1248 in Figure 8) is 68.4%; and in the C- 
terminal region (i.e. amino acids 1249 to 1631 in Figure 8) is 84.6%. Accordingly, the 
SSIII starch synthases are much more highly conserved between maize and wheat in 
10 the region comprising the catalytic domain of the proteins. 

EXAMPLE 14 
Isolation of genomic clones for SSIII 

Screening of a genomic library from the D-genome donor of wheat, 7. tauschii, 
15 identified a number of clones which hybridised to the wSSIII PGR fragment. Positive 
plaques in the genomic library were selected as those hybridising with a probe that had 
been generated by PGR (amplifying between nucleotide positions 3620 to 3966) from 
the SSIII cDNA as template. The primer sequences used were as follows: 
wSS3pa (5' GGAGGTCTTGGTGATGTTGT 3': SEQ ID NO: <400>29); and 
20 wSS3pb (5' GTTGAGGAATGATGGGAATG 3' : SEQ ID NO: <400>30). 

Hybridisation was carried out in 25% formamide, 6 x SSC, 0.1% SDS at 42 °G for 16 
hour, then washed three times with 2 x SSG containing 0.1% SDS at 65 °C, for 1 hour 
per wash. Figure 9 shows an example of a plaque lift showing positive and negative 
25 hybridisations for plaques containing the T. tauschii homologue of the wSSIII.B3 gene. 

DNA was isolated from positive-hybridising A clones using methods described by 
Maniatis et ai Briefly, DNA was digested using BamHI or Bg/I and sub-cloned in to the 
vector pJKKmfm. DNA sequencing was performed using the automated ABI system 
30 with dye terminators as described by the manufacturers. DNA sequences were 
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analysed using the GCG suite of programs (Devereaux etaL, 1984). 

Nucleotide equences of the genomic SSIII clone from F. tauschii are provided 
gerein as 6 contiguous sequences designated fragments 1 to 6 (SEQ ID 
5 NOs:<400>1 1 to <400>16, respectively). Table 3 defines the relative positions of 
these fragments with respect to the SSIII cDNA and describes the positions of 
exons. Figure 1 shows this information schematically. 



10 EXAMPLE 15 

Analysis of wheat starch synthase III mRNA expression 

Figure 10 shows the expression of wSSIII mRNA during endosperm development in 
two wheat varieties grown under defined environmental conditions. The expression 
of the gene is seen very early in endosperm development in both cultivars, 4 days 
15 after anthesis (Figure 10, panels a and b). Expression in the leaf of the variety 
Gabo is very weak (Figure 10, panel c, Lane L) whereas strong expression is seen 
in pre-anthesis florets (Figure 10, panel c, Lane P). 
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EXAMPLE 16 
Amino acid sequence comparisons between 
wheat SSII and SSIII polypeptides 

5 Amino acid sequence comparisons between wheat BSSS, SSI, SSII and SSIII 
polypeptides, as indicated in Figure 13, reveals eight highly-conserved domains. 
The amino acid sequences of these domains are represented in the wheat SSIII 
amino acid sequence by the following sequence motifs: 





(a) Regioni: 


KVGGLGDWTS; 


10 


(b) Region 2: 


GHTVEVILPKY; 




(c) Region 3: 


HDWSSAPVAWLYKEHY; 




(d) Region 4: 


GILNGIDPDiWDPYTD; 




(e) Region 5: 


DVPIVGIITRLTAQKG; 




(f) Region 5a: 


NGQWLLGSA; 


15 


(g) Region 6: 


AGSDFIIVPSIFEPCGLTQLVAMRYGS; and 




{h) Region 7: 


TGGLVDTV. 



As shown in Table 4 below, there is at least about 25% amino acid sequence 
identity, preferably at least about 30% amino acid sequence identity, more 

20 preferably at least about 35% amino acid sequence identity, more preferably at 
least about 40% amino acid sequence identity, more preferably at least about 45% 
amino acid sequence identity, more preferably at least about 50% amino acid 
sequence identity, more preferably at least about 55% amino acid sequence 
identity, more preferably at least about 60% amino acid sequence identity, more 

25 preferably at least about 65% amino acid sequence identity, more preferably at 
least about 70% amino acid sequence identity, more preferably at least about 75% 
amino acid sequence identity, more preferably at least about 80% amino acid 
sequence identity, more preferably at least about 85% amino acid sequence 
identity, more preferably at least about 90% amino acid sequence identity and 

30 even more preferably at least about 95% amino acid sequence identity between the 
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amino acid sequences of plant starch synthase enzymes, in particular wheat starch 
synthases. 

TABLE 4 



5 Identities between conserved motifs of plant starch synthases 



Conserved Region 


Number of conserved 
residues between wheat 
starcli synthases 


Number of conserved 
residues between 
wheat SSII and SSIII 
polypeptides 


Region 1: 
KVGGLGDWTS 


6/11 residues 


6/11 residues 


Region 2: 

GHTVEVILPKY 


6/11 residues 


6/11 residues 


Region 3: 

HDWSSAPVAWLYKEHY 


4/16 residues 


5/16 residues 


Region 4: 

GILNGIDPDIWDPYTD 


7/16 residues 


8/16 residues 


Region 5: 

DVPIVGIITRLTAQKG 


8/16 residues 


10/16 residues 


Region 5a: 
NGQWLLGSA 


4/10 residues 


4/10 residues 


Region 6: 

AGSDFIIVPSIFEPCGLT 
QLVAIVIRYGS 


15/27 residues 


17/27 residues 


Region 7: 

TGGLVDTV 


5/9 residues 


5/9 residues 



25 

The most conserved regions of the wheat SSII and SSIII polypeptides are a region 
of 6 or 7 identical amino acids in Region 1 (Table 4; Figure 13) and a region of 8 or 
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9 identical amino acids in Region 6 (Table 4; Figure 13). The lowest regions of 
identity are found in regions 3 and 5a. 

5 EXAMPLE 17 

Discussion 

Early work on the Sgp-1 starch synthase proteins (Denyer et al., 1995; Rahman ef 
a/., 1995) was based on the localisation of these proteins in the wheat starch 
granule, and no definitive conclusion concerning their presence or absence in 

10 soluble extracts of the wheat endosperm was presented. We have now 

demonstrated that a monoclonal antibody against the Sgp-1 proteins cross reacts 
strongly with those starch synthase proteins having apparent molecular weights of 
100-105 kDa in soluble extracts, however, the appearance of these proteins in 
soluble extracts is dependant on the developmental stage of the endosperm 

15 material. Whilst the proteins can be detected in the soluble phase in early to mid 
endosperm development, little or no soluble protein remains in late endosperm 
development (Figure 1). This observation accounts for the failure of Rahman et ai 
(1995) to detect the protein in soluble extracts in a previous report. 

20 Based upon the localisation of the Sgp-1 starch synthase proteins in the wheat 
endosperm, the following nomenclature is suggested for wheat starch synthase 
enzymes: wGBSS for the 60 kDa granule bound starch synthase (Wx); wSSI for the 
75 kDa starch synthase I (Sgp-3); wSSII for the 100 - 105 kDa proteins (Sgp-1); 
and wSSIII for a soluble high molecular starch synthase. 

25 

The present invention provides cDNA clones encoding the wSSII and wSSIII 
polypeptides and the corresponding genomic clones. 

The wSSIII cDNA clone described herein is clearly related to the maize and potato 
30 SSIII polypeptides. 
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Comparison of the amino acid sequences of all available starch synthases with the 
deduced amino acid sequences of the three wSSII cDNA clones of the present 
invention (i.e. wSSIIB, wSSilA and wSSilD) was conducted using PILEUP analysis 
(Devereayx et al., 1984) and data are presented as a dendrogram (Figure 11). The 
5 sequence of the glycogen synthase of E. coli was also included. Based upon their 
amino acid similarities, four classes of plant starch synthases can be defined: 
GBSS. SSI, SSII and SSIII. 

Based upon sequence identities and the function of the Sgp-1 proteins in wheat, 
10 the wSSIIB, wSSIIA and wSSID cDNA clones are members of the starch synthase 
II (SSII) group and are more similar in sequence to maize SSI la than maize SSIIb. 
Table 5 shows that levels of identity at the amino acid level between the wSSII 
sequences, as determined using the BESTFIT programme in GCG (Devereaux et 
al,, 1984), and other class II starch synthases range from 70% identity with potato 
IS SSII to 85% identity with maize SSIIa. Both wSSliB and wSSilD showed 
significantly higher homology to maize SSIIa than wSSIIA. 



TABLE 5 





wSSII-A 


wSSII-B 


wSSII-D 


wSSi-A 


100% 






wSSil-B 


95.9% 


100% 




wSSII-D 


96.3% 


96.7% 


100% 


maize SSIIa 


76.1% 


85.2% 


84.7% 


maize SSIIb 


76.3% 


76.7% 


75.9% 


pea SSII 


72.0% 


72.2% 


71.8% 


potato SSII 


70.9% 


71.1% 


70.3% 



Whilst the evidence is compelling that the wSSIIA, wSSIIB and wSSIID cDNAs 
encode the Sgp-AI, Sgp-BI and Sgp-D1 proteins of the wheat starch granule, 
30 molecular weights calculated from the deduced amino acid sequences of the clones 
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are considerably lower than estimates obtained from SDS-PAGE. The molecular 
weight of the precursor wSSIIA protein is 87,229 Da, and the mature protein 81,164 
Da, yet the estimated molecular weight in our experience is 105 kDa. The 
assignment of the wSSIIA cDNA to the A-genome of wheat is demonstrated in 
5 Figure 5, and the assignement of the 105 kDa protein to the A-genome in Denyer et 
al. (1995) and Yamamori and Endo (1996). Similarly, the molecular weight of the 
wSSIIB protein is 86,790 Da and the mature protein 80,759 Da. yet the molecular 
weight of the Sgp-BI protein is estimated to be 100 kDa. No comparison can be 
made of the wSSIID sequences as a full length cDNA clone was not obtained. The 

10 wSSIIA and wSSIIB amino acid sequences differ by just a single amino acid 
residue, yet there is an apparent difference of 5 kDa in molecular weight when 
estimated by SDS-PAGE. Several possibilities can be advanced to account for this 
apparent discrepancy in molecular weights. Firstly, the wSSII proteins may not 
migrate in SDS-PAGE in accordance with their molecular weight because they 

15 retain some conformation under the denaturing conditions used. Secondly, the 
proteins may be glycosylated. It is also possible that the proteins may be non- 
covalently linked to starch through a high affinity starch binding site which survives 
denaturation and SDS-PAGE. Differences between the apparent molecular weights 
and those calculated from the deduced amino acid sequences will have to be 

20 defined in establishing the relationship between the wSSII proteins and proteins 
encoded by the analogous SSil genes of other species. 

The catalytic domain of the starch synthases is found at the C-terminal end of the 
protein (Gao ef a/., 1998; Harn etai, 1998). Harn etaL (1998) identified 7 
25 conserved regions among SSIIa, SSIIb, SSI and GBSS sequences. 

We have identified include an additional conserved region (designated region 5a in 
Figure 3 and Figure 12) comprising the amino acid sequence motif DVQLVMLGTG. 
Comparison of the wSSII sequences of the present invention with differing isoforms 
30 of starch synthases (GBSS. SS1. SSI! and SSIII) identified a total of 8 regions of 
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the deduced amino acid sequences which were conserved amongst starch 
synthases from each class. Figure 12 shows an alignment of plant starch synthase 
sequences, in which the position of the first homologous region is used as the basis 
of the alignment. This first homologous region contains the consensus motif KXGG 
5 which is believed to be present in the ADPglucose binding site of starch and 
glycogen synthases (Furukawa ef a/., 1990). 

The conservation of eight peptide regions among the 4 classes of starch synthases 
is striking, in terms of their sequence homologies and their alignment. The major 

10 differences in structure between the classes of genes are found in the length of the 
N-terminal region between the transit peptide and the first conserved region. At one 
extreme, the GBSS genes have a very short N-terminal arm, whereas the du1 
starch synthase contains a very long N-terminal extension containing several 
distinct regions (Figure 12). The wSSII genes contain an N-terminal extension 

15 which is longer than either GBSS.SSI, or SSIIb, and slightly longer than the maize 
SSIIa gene (Figure 12). Analysis of the wheat SSII genes shows that there is a 
motif, PVNGENK, which is repeated. The area surrounding the repeated 
PVNGENK motif Is not homologous to maize SSIIa and the insertion of this region 
is responsible for the difference in length between the wheat SSII and maize SSIIa 

20 genes. In pea and potato SSII polypeptides, a PPP motif (Figure 3; residues 251- 
253 and 287-289 respectively) has been suggested to mark the end of the N- 
terminal region and to facilitate the flexibility of an "N-terminal arm". This motif is not 
found in either the maize or wheat SSII sequences. 

25 The generation of a wheat line combining null alleles at each of the three wSSII 
loci, wSSIIA, wSStIB and wSSIID, has been reported recently by Yamamori (1998). 
In this triple null line, the large starch granules were reported to be mostly deformed 
and a novel starch with high blue value was observed when stained with iodine, 
indicating that wSSII is a key enzyme for the synthesis of starch in wheat. Further 

30 analysis of the starch derived from this triple null mutant is in progress. 
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Mutations in starch synthases are known in three other species. In pea, mutation in 
SSII gives rise to starch with altered granule morphology and an amylopectin which 
yields an oligosaccharide distribution with reduced chain length on debranching, 
compared to the wild type (Craig et aL, 1998). A similar mutation in a gene 
5 designated SSII is known in Chlamydomonas (the sta-3 mutation) and similar 
effects on granule morphology and amylopectin structure are observed (Fontaine et 
ai, 1993). In maize, two mutations affecting starch synthases are known. First, the 
duZ/tmutation has been shown to be caused by a lesion within the ofu1 SSIII-type 
starch synthase gene (Gao ef a/., 1998). A second mutation, the sugary-2 mutation 
10 yields a starch with reduced amylopectin chain lengths on debranching (this 
mutation co-segregates with the SSI la locus (Harn et aL, 1998) although direct 
evidence that the sugary-l mutation is caused by a lesion in the SSIIa gene is 
lacking). In the SSII mutants of each of these species, amylose biosynthesis 
capacity is retained, suggesting different roles in amylose and amylopectin 
15 synthesis for the GBSS and SSII genes. Given the conservation in overall 
organisation of the GBSS and SSII genes (see Figures 11 and 12). when an 
alignment is made based on the KTGGL motif of the first consen/ed region, this 
focuses attention on the role(s) of the N-terminal region in defining substrate 
specificity and the localisation of the proteins as the N-terminal region is the major 
20 area of divergence between the 4 classes of starch synthases. However, it is 
premature to exclude the influence of more subtle mutations in central and C- 
terminal regions of the gene. 

The cloning of the wSSII and wSSIII cDNAs and genomic clones described herein 
provides useful tools for the further study of the roles of the starch synthases in 
wheat. Firstly, they provide a source of markers which can be used to recover and 
combine null or divergent alleles. Secondly, genetic manipulation of wheat by gene 
suppression or over-expression can be carried out, and the genes may be used for 
overexpression in other species. The promoter regions of these genes are also 
useful in regulating the expression of starch synthase genes and other 
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heterologous genes in the developing wheat endosperm and in pre-anthesis florets 
of wheat. 



P:\OPER\MRO\PI-WSS.PRV - 29/4/99 



-78. 

REFERENCES 

1. Ausubel. F. M., Brent, R.. Kingston, RE, Moore, D.D., Seidman, J.G., Smith, 
J.A., and Struhl, K. (1987). In: Current Protocols in Molecular Biology. Wiley 
Interscience (ISBN 047150338), 
5 2. Abel GJW, Springer F, Wiilmitzer L, Kossmann J (1996) Cloning and 
functional analysis of a cDNA encoding a novel 139 kDa starch synthase from 
potato (Solanum tuberosum L.). Plant J 10: 981-991. 

3. Ainsworlh C, Clark J, Balsdon J (1993) Expression, organisation and 
structure of the genes encoding the waxy protein (granule-bound starch synthase) 

10 in wheat. Plant Mol Biol 22: 67-82. 

4. Baba T, Nishihara M, Mizuno K, Kawasaki T, Shimada H, Kobayabashi E, 
Ohnishi S, Tanaka K, Arai Y (1993) Identification, cDNA cloning, and Gene 
Expression of Soluble Starch Synthase in Rice {Oryza sativa L.) Immature Seeds. 
Plant Physiol 103: 565-673. 

15 5. Craig J, Lloyd JR, Tomlinson K, Barber L, Edwards A, Wang TL, Martin C, 
Hedley CL, Smith AM (1998) Mutations in the gene encoding starch synthase II 
profoundly alter amylopectin structure in pea embryos. Plant Cell 10: 413-426. 

6. Denyer K, Hylton CM, Jenner CF, Smith AM (1995) Identification of multiple 
isoforms of soluble and granule-bound starch synthase in developing wheat 

20 endosperm. Planta 196: 256-265. 

7. Devereaux, J, Haeberii P, Smithies O (1984) A comprehensive set of 
sequence analysis programs for the VAX. Nucleic Acids Res 12: 387-395. 

8. Dry I, Smith A, Edwards A, Bhattacharyya M, Dunn P, Martin C (1992) 
25 Characterisation of cDNAs encoding two isoforms of granule-bound starch 

synthase which show differential expression in developing storage organ of pea 
and potato. Plant J 2: 193-202. 

9. Edwards A, Marshall J. Sidebottom C, Visser RGF, Smith AM. Martin C 
(1995) Biochemical and molecular characterization of a novel starch synthase from 

30 potato tubers. Plant J 8: 283-294. 



P:\OPER\MRO\PI-WSS.PRV - 29A 



-79- 

10. Fontaine T, D'Hulst C, Maddelein M-L. Routier F, Pepin TM, Decq A, 
WieruszeskI J-M, Delrue B, Van den Koornhuyse N, Bossu J-P, Fournet B, Ball S 
(1993) Toward an understanding of the biogenesis of the starch granule. Evidence 
that Chlamydomonas soluble starch synthase II controls the synthesis of 

5 intermediate size glucans of amylopectin. J Biol Chem 22: 16223-16230. 

1 1 . Furukawa K, Tagaya M, Inouye M. Preiss J, Fukui T (1990) Identification of 
lysine 15 at the active site in Escherichia co// glycogen synthase. J Biol Chem 265: 
2086-2090. 

10 

12. Gao M, Wanat J, Stinard PS, James MG, Myers AM (1998) Characterization 
of dulH, a maize gene coding for a novel starch synthase. Plant Cell 10: 399-412. 

13. Harn C, Knight M, Ramakrishnan A, Guan H, Keeling PL, Wasserman BP 
15 (1998) Isolation and characterization of the zSSIIa and zSSIIb starch synthase 

cDNA clones from maize endosperm. Plant Mol Biol 37: 639-649. 

14. Kloesgen RB, Gierl A, Schwarz-Sommer ZS, Saedler H (1986) Molecular 
analysis of the waxy locus of Zea mays. Mol Gen Genet 203: 237-244. 

20 

15. Knight ME, Harn C, Lilley CER, Guan H, Singletary G W, MuForster C, 
Wasserman BP, Keeling PL (1998) Molecular cloning of starch synthase I from 
maize (W64) endosperm and expression in Escherichia. Plant J 14: 613-622. 

25 16, Kumar A, Larsen CE. Preiss J (1986) Biosynthesis of bacterial glycogen: 
Primary structure of Escherichia co// ADP-glucose:alpha-1,4-glucan, 4- 
glucosyltransferase as deduced from the nucleotide sequence of the gIgA gene. J 
Biol Chem 261: 16256-16259. 

30 17. Li Z, Rahman S, Kosar-Hashemi B, Mouille G, Appels R Morell, MK (1999) 



P:\OPER\MRO\PI-WSS. PRV - 29/4/99 



-80- 

Cloning and characterisation of a gene encoding wheat starch synthase I. Theor 
AppI Genet: In press. 

18. Mouille G, Maddelein M-L, Libessart N, Talaga P, Decq A, Delrue B Ball, S 
5 (1996). Preamylopectin processing: A mandatory step for starch biosynthesis in 

plants. Plant Cell 8: 1353-1366. 

19. Nakamura T, Yamamori M, Hirano H, Hidaka S, Nagamine T (1995) 
Production of waxy (amylose-free) wheats. Mo! Gen Genet 248: 253-259. 

10 

20. Okagakl, RJ (1992) Nucleotide sequence of a long cDNA from the rice waxy 
gene. Plant Mol Biol 19: 513-516. 

21. Ozbun, J.L.. Hawker, J.S. and Preiss, J. (1971) Adensine diphosphoglucose- 
15 starch glucosyltransferases from developing kernels of waxy maize. Plant 

Physiology 48: 765-769 

22. Ozbun, J.L., Hawker, J.S., Greenberg, E., Lammel, C. Preiss, J. and Lee, 
E.Y.C.(1973) Starch synthetase, phosphorylase, ADPglucose pyrophosphorylase, 
and UDPglucose pyrophosphorylase in developing maize kernels. Plant Physiology 

20 51: 1-5. 

23. Pollock, C. and Preiss, J.(1980) The citrate-stimulated starch synthase of 
starchy maize kernels: purification and properties. Arch Biochem Biophys 204: 
578-588. 

24. Rahman S, Abrahams S, Abbott D. Mukai Y. Samuel M, Morell M. Appels R 
25 (1997) A complex arrangement of genes at a starch branching enzyme I locus in D- 

genome donor of wheat. Genome 40: 465-474. 

25. Rahman S, Kosar-Hashemi B, Samuel M, Hill A, Abbott DC, Skerritt JH, 
Preiss J, Appels R, Morell M (1995) The major proteins of wheat endosperm starch 
granules. Aust J Plant Physiol 22: 793-803. 



P:\OPER\MRO\PI-WSS.PRV - 29A 



-81 - 

26. Rahman S. Li Z. Abrahams S, Abbott D, Appels R, Morell M (1998) 
Characterisation of a gene encoding wheat endosperm starch branching enzyme-l. 
Theor AppI Genet 98: In press. 

27. Sears ER, Miller TG (1 985) The history of Chinese spring wheat. Cereal Res 
5 Comm 13: 261-263. 

28. Takaoka M, Watanabe S, Sassa H, Yamamori M, Nakamura T, Sasakuma T. 
Hirano H (1997) Structural characterisation of high molecular weight starch granule- 
bound proteins in wheat {Triticum aestivum L). J Agric Food Chem 45: 2929-2934. 

29. van der Leij FR, Visser RGF, Ponstein AS, Jacobsen E, Feenstra WJ (1991) 
10 Sequence of the structural gene for granule bound starch synthase of potato 

{Solanum tuberosum L.) and evidence for a single point deletion in the am/ allele. 
Mol Gen Genet 228: 240-248. 

30. Yamamori M, Endo TR (1 996) Variation of starch granule proteins and 
chromosome mapping of their coding genes in common wheat. Theor AppI Genet 

15 93:275-281. 

31 . Yamamori M (1 998) Selection of a wheat lacking a putative enzyme for 
starch synthesis, SGP-1 Pnoc 9"" tnt Wheat Gen Symp 4, 300-302. 



I * 

P:\OPER\MRO\PI-WSS.PRV - 29/4/99 



- 1 - 



SEQUENCE LISTING 
<110> COMMONWEALTH SCIENTIFIC AND INDUSTRIAL ORGANISATION 
<120> NOVEL GENES ENCODING WHEAT STARCH SYNTHASES AND USES THEREFOR 
<13 0> p:\oper\mro\pi-wss.prv 

<140> 
<141> 

<160> 36 

<170> PatentIn Ver. 2.0 

<210> 1 
<211> 2939 
<212> DNA 

<213> Triticum aestlvum 

<220> 
<221> CDS 

<222> (176) . . (2572) 
<400> 1 

atttcctcgg cctgaccccg tgcgtttacc ccacacagag cacactccag tccagtccag 60 

cccactgccg cgctactccc cactcccact gccaccacct ccgcctgcgc cgcgctctgg 12 0 

gcggaccaac ccgcgcatcg tatcacgatc acccaccccg atcccggccg ccgcc atg 178 

Met 
1 

teg teg gcg gtc gcg tec gee gcg tec ttc etc gcg etc geg tee gee 226 
Ser Ser Ala Val Ala Ser Ala Ala Ser Phe Leu Ala Leu Ala Ser Ala 
5 10 15 

tec ccc ggg aga tea egg agg agg acg agg gtg age gcg teg cca ccc 274 
Ser Pro Gly Arg Ser Arg Arg Arg Thr Arg Val Ser Ala Ser Pro Pro 
20 25 30 

cae ace ggg get ggc agg ttg cac tgg ccg cog teg ccg ccg cag cgc 322 
His Thr Gly Ala Gly Arg Leu His Trp Pro Pro Ser Pro Pro Gin Arg 
35 40 45 

acg get cgc gac gga gcg gtg gcc gcg cgc gee gee ggg aag aag gae 3 70 
Thr Ala Arg Asp Gly Ala Val Ala Ala Arg Ala Ala Gly Lys Lys Asp 
50 55 60 65 

gcg ggg ate gac gae gee geg ccc gcg agg cag ccc cgc gca etc cgc 418 
Ala Gly lie Asp Asp Ala Ala Pro Ala Arg Gin Pro Arg Ala Leu Arg 
70 75 80 

ggt ggc gee gcc ace aag gtt gcg gag egg agg gat ccc gtc aag acg 466 
Gly Gly Ala Ala Thr Lys Val Ala Glu Arg Arg Asp Pro Val Lys Thr 
85 90 95 

etc gat cgc gae gcc gcg gaa ggt ggc gcg ccg tec ccg ccg gca ccg 514 
Leu Asp Arg Asp Ala Ala Glu Gly Gly Ala Pro Ser Pro Pro Ala Pro 
100 105 110 



agg cag gag gac gcc cgt ctg ccg age atg aac ggc atg 
Arg Gin Glu Asp Ala Arg Leu Pro Ser Met Asn Gly Met 
115 120 125 



ccg gtg aac 
Pro Val Asn 



562 
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610 



ttc gaa cac cac cag aat cac gat tec ggg cct ttg gca ggg gag aac 
Phe Glu His His Gin Asn His Asp Ser Gly Pro Leu Ala Gly Glu Asn 
290 295 300 305 



gag gaa gcc tac gat gtc gga gtc cga aaa tac tac aag get get gga 
Glu Glu Ala Tyr Asp Val Gly Val Arg Lys Tyr Tyr Lys Ala Ala Gly 
355 360 365 



€58 



706 



ggt gaa aac aaa tet ace ggc ggc ggc ggc geg act aaa gac age ggg 
Gly Glu Asn Lys Ser Thr Gly Gly Gly Gly Ala Thr Lys Asp Ser Gly 
130 135 140 145 

ctg ccc gca ccc gca cgc gcg ccc cag ccg teg age cag aac aga gta 
Leu Pro Ala Pro Ala Arg Ala Pro Gin Pro Ser Ser Gin Asn Arg Val 
150 155 160 

ccg gtg aat ggt gaa aac aaa get aac gtc gcc teg ccg ccg acg age 
Pro Val Asn Gly Glu Asn Lys Ala Asn Val Ala Ser Pro Pro Thr Ser 
165 170 175 

ata gcc gag gtc gcg get ccg gat ccc gca get ace att tee ate agt 754 
He Ala Glu Val Ala Ala Pro Asp Pro Ala Ala Thr He Ser He Ser 
180 185 190 

gac aag gcg eca gag tec gtt gtc cea gee gag aag geg ccg ccg teg 
Asp Lys Ala Pro Glu Ser Val Val Pro Ala Glu Lys Ala Pro Pro Ser 
195 200 205 

tec ggc tea aat ttc gtg ccc teg get tet get ccc ggg tet gac act 
Ser Gly Ser Asn Phe Val Pro Ser Ala Ser Ala Pro Gly Ser Asp Thr 
210 215 220 225 

gtc age gac gtg gaa ctt gaa ctg aag aag ggt geg gtc att gtc aaa 
Val Ser Asp Val Glu Leu Glu Leu Lys Lys Gly Ala Val He Val Lys 
230 235 240 

gaa get eca aac eca aag get ctt teg ccg ccc gca gca ccc get gta 
Glu Ala Pro Asn Pro Lys Ala Leu Ser Pro Pro Ala Ala Pro Ala Val 
245 250 255 

caa caa gac ctt tgg gac ttc aag aaa tac att ggt ttc gag gag ccc 994 
Gin Gin Asp Leu Trp Asp Phe Lys Lys Tyr He Gly Phe Glu Glu Pro 
260 265 270 

gtg gag gcc aag gat gat ggc egg get gtt gca gat gat gcg ggc tec 1042 
Val Glu Ala Lys Asp Asp Gly Arg Ala Val Ala Asp Asp Ala Gly Ser 
275 280 285 



802 



850 



898 



946 



1090 



gtc atg aac gtg gtc gtc gtg get get gaa tgt tet ccc tgg tgc aaa 1138 
Val Met Asn Val Val Val Val Ala Ala Glu Cys Ser Pro Trp Cys Lys 
310 315 320 

aca ggt ggt ctt gga gat gtt gcc ggt get ttg ccc aag get ttg gcg 1186 
Thr Gly Gly Leu Gly Asp Val Ala Gly Ala Leu Pro Lys Ala Leu Ala 

325 330 335 

aag aga gga cat cgt gtt atg gtt gtg gta eca agg tat ggg gac tat 1234 
Lys Arg Gly His Arg Val Met Val Val Val Pro Arg Tyr Gly Asp Tyr 
340 345 350 



1282 



cag gat atg gaa gtg aat tat ttc cat get tat ate gat gga gtt gat 1330 
Gin Asp Met Glu Val Asn Tyr Phe His Ala Tyr He Asp Gly Val Asp 
370 375 380 385 
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ttt gtg ttc att gac get cct etc ttc cga cac cgc eag gaa gac att 
Phe Val Phe lie Asp Ala Pro Leu Phe Arg His Arg Gin Glu Asp lie 
390 395 400 



1378 



tat ggg ggc age aga eag gaa att atg aag cgc atg att ttg tte tgc 
Tyr Gly Gly Ser Arg Gin Glu lie Met Lys Arg Met lie Leu Phe Cys 
405 410 415 



1426 



aag gee get gte gag gtt cea tgg eae gtt cea tge ggc ggt gtc cct 
Lys Ala Ala Val Glu Val Pro Trp His Val Pro Cys Gly Gly Val Pro 
420 425 430 



1474 



tat ggg gat gga aat ctg gtg ttt att gca aat gat tgg cac aeg gca 
Tyr Gly Asp Gly Asn Leu Val Phe He Ala Asn Asp Trp His Thr Ala 
435 440 445 



1522 



etc ctg cct gtc tat ctg aaa gca tat tae agg gac cat ggt ttg atg 
Leu Leu Pro Val Tyr Leu Lys Ala Tyr Tyr Arg Asp His Gly Leu Met 
450 455 460 465 



1570 



eag tae act egg tec att atg gtg ata eat aac ate get eae eag gge 
Gin Tyr Thr Arg Ser He Met Val He His Asn He Ala His Gin Gly 
470 475 480 



1618 



cgt gge cea gta gat gag tte eeg tte aee gag ttg cct gag eae tae 
Arg Gly Pro Val Asp Glu Phe Pro Phe Thr Glu Leu Piro Glu His Tyr 
485 490 495 



1666 



ctg gaa eae ttc aga ctg tac gac ccc gtg ggt ggt gaa cac gcc aac 
Leu Glu His Phe Arg Leu Tyr Asp Pro Val Gly Gly Glu His Ala Asn 
500 505 510 



1714 



tac ttc gcc gcc ggc ctg aag atg gcg gac eag gtt gte gte gtg age 
Tyr Phe Ala Ala Gly Leu Lys Met Ala Asp Gin Val Val Val Val Ser 
515 520 525 



1762 



ccg ggg tae ctg tgg gag ctg aag acg gtg gag ggc ggc tgg ggg ett 
Pro Gly Tyr Leu Trp Glu Leu Lys Thr Val Glu Gly Gly Trp Gly Leu 
530 535 540 545 



1810 



cac gac ate ata egg eag aac gac tgg aag acc cgc ggc ate gtg aac 
His Asp He He Arg Gin Asn Asp Trp Lys Thr Arg Gly He Val Asn 
550 555 560 



1858 



ggc ate gac aac atg gag tgg aac ccc gag gtg gac gtc cac etc aag 
Gly He Asp Asn Met Glu Trp Asn Pro Glu Val Asp Val His Leu Lys 
565 570 575 



1906 



teg gac gge tac acc aac ttc tec ctg ggg acg ctg gac tee gge aag 
Ser Asp Gly Tyr Thr Asn Phe Ser Leu Gly Thr Leu Asp Ser Gly Lys 
580 585 590 



1954 



egg cag tge aag gag gcc ctg eag egg gag ctg gge ctg eag gtc cgc 2002 
Arg Gin Cys Lys Glu Ala Leu Gin Arg Glu Leu Gly Leu Gin Val Arg 
595 600 605 

gge gac gtg ccg ctg etc ggc tte ate ggg cgc ctg gac ggg eag aag 2050 
Gly Asp Val Pro Leu Leu Gly Phe He Gly Arg Leu Asp Gly Gin Lys 
610 615 620 625 

9gc gtg gag ate ate gcg gac gcg atg ccc tgg ate gtg age cag gac 2098 
Gly Val Glu He He Ala Asp Ala Met Pro Trp He Val Ser Gin Asp 
630 635 640 

gtg cag ctg gtc atg ctg ggc acc ggg cgc cac gac ctg gag ggc atg 2146 
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Val Gin Leu Val Met Leu Gly Thr Gly Arg His Asp Leu Glu Gly Met 
645 650 655 

ctg egg cac ttc gag egg gag cac cac gac aag gtg cgc ggg tgg gtg 2194 
Leu Arg His Phe Glu Arg Glu His His Asp Lys Val Arg Gly Trp Val 
660 665 670 

ggg ttc tec gtg egg ctg gcg cac egg ate acg gee ggc gee gac gcg 2242 
Gly Phe Ser Val Arg Leu Ala His Arg lie Thr Ala Gly Ala Asp Ala 
675 680 685 

etc etc atg cec tec egg ttc gag ccg tgc gga ctg aac eag etc tac 2290 
Leu Leu Met Pro Ser Arg Phe Glu Pro Cys Gly Leu Asn Gin Leu Tyr 
690 695 700 705 

gee atg gee tac ggc ace gtc cec gtc gtg cat gcc gtc ggt ggc ctg 233 8 
Ala Met Ala Tyr Gly Thr Val Pro Val Val His Ala Val Gly Gly Leu 
710 715 720 

^99 ace gtg ccg ccg ttc gac cec ttc aae cac tec ggg etc ggg 23 86 

Arg Asp Thr Val Pro Pro Phe Asp Pro Phe Asn His Ser Gly Leu Gly 
725 730 735 

tgg aeg ttc gac cgc gea gag gcg eag aag ctg ate gag gcg etc ggg 2434 
Trp Thr Phe Asp Arg Ala Glu Ala Gin Lys Leu lie Glu Ala Leu Gly 
740 745 750 

eae tgc etc cgc ace tac egg gac tac aag gag age tgg agg ggg etc 2482 
His Cys Leu Arg Thr Tyr Arg Asp Tyr Lys Glu Ser Trp Arg Gly Leu 
755 760 765 

cag gag cgc ggc atg teg cag gac ttc age tgg gag eat gee gee aag 2530 
Gin Glu Arg Gly Met Ser Gin Asp Phe Ser Trp Glu His Ala Ala Lys 
770 775 780 785 

etc tac gag gac gtc etc gtc aag gcc aag tac cag tgg tga 2572 
Leu Tyr Glu Asp Val Leu Val Lys Ala Lys Tyr Gin Trp 





790 




795 








acgetagctg 


ctagecggte 


cagececgea 


tgegtgcatg 


acaggatgga 


attgcgeatt 


2632 


gcgeacgeag 


gaaggtgeea 


tggagegeeg 


geatccgega 


agtaeagtga 


eatgaggtgt 


2692 


gtgtggttga 


gacgetgatt 


ecgatetggt 


ccgtagcaga 


gtagagcgga 


ggtagggaag 


2752 


cgctccttgt 


tacaggtata 


tgggaatgtt 


gttaacttgg 


tattgtaatt 


tgttatgttg 


2812 


tgtgcattat 


tacagagggc 


aacgatctgc 


gccggcgeac 


cggeecaact 


9ttgggccgg 


2872 


tegeacagea 


gccgttggat 


cegacegcet 


gggecgttgg 


atcccaccga 


aaaaaaaaaa 


2932 


aaaaaaa 












2939 



<210> 2 
<211> 798 
<212> PRT 

<213> Triticum aestivum 
<400> 2 

Met Ser Ser Ala Val Ala Ser Ala Ala Ser Phe Leu Ala Leu Ala Ser 
15 10 15 



Ala Ser Pro Gly Arg Ser Arg Arg Arg Thr Arg Val Ser Ala Ser Pro 
20 25 30 
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Pro His Thr Gly Ala Gly Arg Leu His Trp Pro Pro Ser Pro Pro Gin 
35 40 45 

Arg Thr Ala Arg Asp Gly Ala Val Ala Ala Arg Ala Ala Gly Lys Lys 
50 55 60 

Asp Ala Gly lie Asp Asp Ala Ala Pro Ala Arg Gin Pro Arg Ala Leu 
65 70 75 80 

Arg Gly Gly Ala Ala Thr Lys Val Ala Glu Arg Arg Asp Pro Val Lys 
85 90 95 

Thr Leu Asp Arg Asp Ala Ala Glu Gly Gly Ala Pro Ser Pro Pro Ala 
100 105 110 

Pro Arg Gin Glu Asp Ala Arg Leu Pro Ser Met Asn Gly Met Pro Val 
115 120 125 

Asn Gly Glu Asn Lys Ser Thr Gly Gly Gly Gly Ala Thr Lys Asp Ser 
130 135 140 

Gly Leu Pro Ala Pro Ala Arg Ala Pro Gin Pro Ser Ser Gin Asn Arg 
145 150 155 160 

Val Pro Val Asn Gly Glu Asn Lys Ala Asn Val Ala Ser Pro Pro Thr 
165 170 175 

Ser lie Ala Glu Val Ala Ala Pro Asp Pro Ala Ala Thr lie Ser lie 
180 185 190 

Ser Asp Lys Ala Pro Glu Ser Val Val Pro Ala Glu Lys Ala Pro Pro 
195 200 205 

Ser Ser Gly Ser Asn Phe Val Pro Ser Ala Ser Ala Pro Gly Ser Asp 
210 215 220 

Thr Val Ser Asp Val Glu Leu Glu Leu Lys Lys Gly Ala Val lie Val 
225 230 235 240 

Lys Glu Ala Pro Asn Pro Lys Ala Leu Ser Pro Pro Ala Ala Pro Ala 
245 250 255 

Val Gin Gin Asp Leu Trp Asp Phe Lys Lys Tyr lie Gly Phe Glu Glu 
260 265 270 

Pro Val Glu Ala Lys Asp Asp Gly Arg Ala Val Ala Asp Asp Ala Gly 
275 280 285 

Ser Phe Glu His His Gin Asn His Asp Ser Gly Pro Leu Ala Gly Glu 

290 295 300 

Asn Val Met Asn Val Val Val Val Ala Ala Glu Cys Ser Pro Trp Cys 
305 310 315 320 

Lys Thr Gly Gly Leu Gly Asp Val Ala Gly Ala Leu Pro Lys Ala Leu 
325 330 335 

Ala Lys Arg Gly His Arg Val Met Val Val Val Pro Arg Tyr Gly Asp 
340 345 350 

Tyr Glu Glu Ala Tyr Asp Val Gly Val Arg Lys Tyr Tyr Lys Ala Ala 
355 360 365 

Gly Gin Asp Met Glu Val Asn Tyr Phe His Ala Tyr He Asp Gly Val 
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370 375 380 

Asp Phe Val Phe lie Asp Ala Pro Leu Phe Arg His Arg Gin Glu Asp 
385 390 395 400 

lie Tyr Gly Gly Ser Arg Gin Glu lie Met Lys Arg Met lie Leu Phe 
405 410 415 

Cys Lys Ala Ala Val Glu Val Pro Trp His Val Pro Cys Gly Gly Val 
420 425 430 

Pro Tyr Gly Asp Gly Asn Leu Val Phe He Ala Asn Asp Trp His Thr 
435 440 445 

Ala Leu Leu Pro Val Tyr Leu Lys Ala Tyr Tyr Arg Asp His Gly Leu 
450 455 460 

Met Gin Tyr Thr Arg Ser He Met Val He His Asn He Ala His Gin 
465 470 475 480 

Gly Arg Gly Pro Val Asp Glu Phe Pro Phe Thr Glu Leu Pro Glu His 
485 490 495 

Tyr Leu Glu His Phe Arg Leu Tyr Asp Pro Val Gly Gly Glu His Ala 
500 505 510 

Asn Tyr Phe Ala Ala Gly Leu Lys Met Ala Asp Gin Val Val Val Val 
515 520 525 

Ser Pro Gly Tyr Leu Trp Glu Leu Lys Thr Val Glu Gly Gly Trp Gly 
530 535 540 

Leu His Asp He He Arg Gin Asn Asp Trp Lys Thr Arg Gly He Val 
545 550 555 560 

Asn Gly He Asp Asn Met Glu Trp Asn Pro Glu Val Asp Val His Leu 
565 570 575 

Lys Ser Asp Gly Tyr Thr Asn Phe Ser Leu Gly Thr Leu Asp Ser Gly 
580 585 590 

Lys Arg Gin Cys Lys Glu Ala Leu Gin Arg Glu Leu Gly Leu Gin Val 
595 600 605 

Arg Gly Asp Val Pro Leu Leu Gly Phe He Gly Arg Leu Asp Gly Gin 
610 615 620 

Lys Gly Val Glu He He Ala Asp Ala Met Pro Trp He Val Ser Gin 
625 630 635 640 

Asp Val Gin Leu Val Met Leu Gly Thr Gly Arg His Asp Leu Glu Gly 
645 650 655 

Met Leu Arg His Phe Glu Arg Glu His His Asp Lys Val Arg Gly Trp 
660 665 670 

Val Gly Phe Ser Val Arg Leu Ala His Arg He Thr Ala Gly Ala Asp 
675 680 685 

Ala Leu Leu Met Pro Ser Arg Phe Glu Pro Cys Gly Leu Asn Gin Leu 
690 695 700 

Tyr Ala Met Ala Tyr Gly Thr Val Pro Val Val His Ala Val Gly Gly 
705 710 715 720 
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Leu Arg Asp Thr Val Pro Pro Phe Asp Pro Phe Asn His Ser Gly Leu 
725 730 735 

Gly Trp Thr Phe Asp Arg Ala Glu Ala Gin Lys Leu lie Glu Ala Leu 

740 745 750 

Gly His Cys Leu Arg Thr Tyr Arg Asp Tyr Lys Glu Ser Trp Arg Gly 
755 760 765 

Leu Gin Glu Arg Gly Met Ser Gin Asp Phe Ser Trp Glu His Ala Ala 
770 775 780 

Lys Leu Tyr Glu Asp Val Leu Val Lys Ala Lys Tyr Gin Trp 
785 790 795 



<210> 3 
<211> 2800 
<212> DNA 

<213> Triticum aestivum 

<220> 

<221> CDS 

<222> (89) . , (2488) 

<400> 3 

gctgccacca cctccgcctg cgccgcgctc tgggcggagg accaacccgc gcatcgtacc 60 

atcgcccgcc ccgatcccgg ccgccgcc atg teg teg gcg gtc gcg tec gee 112 

Met Ser Ser Ala Val Ala Ser Ala 
1 5 

gcg tec ttc etc gcg etc gee tec gee tee ccc ggg aga tea cgc agg 160 
Ala Ser Phe Leu Ala Leu Ala Ser Ala Ser Pro Gly Arg Ser Arg Arg 
10 15 20 

egg gcg agg gtg age gcg ccg cca ccc cac gcc ggg gcc ggc agg ctg 208 
Arg Ala Arg Val Ser Ala Pro Pro Pro His Ala Gly Ala Gly Arg Leu 
25 30 35 40 

cac tgg ccg ccg tgg ccg ccg cag cgc acg get cgc gac gga ggt gtg 256 
His Trp Pro Pro Trp Pro Pro Gin Arg Thr Ala Arg Asp Gly Gly Val 
45 50 55 

gee gcg cgc gee gcc ggg aag aag gac gcg agg gtc gac gac gac gcc 304 
Ala Ala Arg Ala Ala Gly Lys Lys Asp Ala Arg Val Asp Asp Asp Ala 
60 65 70 

gcg tec gcg agg cag ccc cgc gca cgc cgc ggt ggc gcc gcc ace aag 352 
Ala Ser Ala Arg Gin Pro Arg Ala Arg Arg Gly Gly Ala Ala Thr Lys 
75 80 85 

gtc gcg gag egg agg gat ccc gtc aag acg etc gat cgc gac gcc gcg 400 
Val Ala Glu Arg Arg Asp Pro Val Lys Thr Leu Asp Arg Asp Ala Ala 
90 95 100 

gaa ggt ggc gcg ccg gca ccg ccg gca ccg agg cag gac gcc gcc cgt 448 
Glu Gly Gly Ala Pro Ala Pro Pro Ala Pro Arg Gin Asp Ala Ala Arg 
105 110 115 120 

cca ccg agt atg aac ggc acg ccg gtg aac ggt gag aac aaa tct acc 4 96 
Pro Pro Ser Met Asn Gly Thr Pro Val Asn Gly Glu Asn Lys Ser Thr 
125 130 135 



ggc ggc ggc ggc gcg acc aaa gac age ggg ctg ccc gca ccc gca cgc 



544 
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Gly Gly Gly Gly Ala Thr Lys Asp Ser Gly Leu Pro Ala Pro Ala Arg 
140 145 150 

gcg ccc cat ccg teg acc cag aac aga gta cca gtg aac ggt gaa aac 592 
Ala Pro His Pro Ser Thr Gin Asn Arg Val Pro Val Asn Gly Glu Asn 
155 160 165 

aaa get aae gtc gee teg ccg eeg acg age ata gcc gag gtc gtg get 64 0 
Lys Ala Asn Val Ala Ser Pro Pro Thr Ser He Ala Glu Val Val Ala 
170 175 180 

ccg gat tec gca get ace att tec ate agt gac aag gcg eeg gag tee 68 8 
Pro Asp Ser Ala Ala Thr He Ser He. Ser Asp Lys Ala Pro Glu Ser 
185 190 195 200 

gtt gtc cca gcc gag aag ccg ccg ccg teg tee ggc tea aat ttc gtg 736 
Val Val Pro Ala Glu Lys Pro Pro Pro Ser Ser Gly Ser Asn Phe Val 
205 210 215 



gtc teg get tct get ccc agg ctg gac att gac age gat gtt gaa cet 
Val Ser Ala Ser Ala Pro Arg Leu Asp He Asp Ser Asp Val Glu Pro 
220 225 230 

gaa ctg aag aag ggt gcg gtc ate gtc gaa gaa get cca aac cca aag 
Glu Leu Lys Lys Gly Ala Val He Val Glu Glu Ala Pro Asn Pro Lys 
235 240 245 

get ett teg ccg cet gea gcc ccc get gta caa gaa gac ctt tgg gac 
Ala Leu Ser Pro Pro Ala Ala Pro Ala Val Gin Glu Asp Leu Trp Asp 
250 255 260 



gtt gee ggt get ttg ccc aag get ttg gcg aag aga gga eat cgt gtt 
Val Ala Gly Ala Leu Pro Lys Ala Leu Ala Lys Arg Gly His Arg Val 
330 335 340 



784 



832 



880 



ttc aag aaa tae att gge ttc gag gag ccc gtg gag gcc aag gat gat 92 8 
Phe Lys Lys Tyr He Gly Phe Glu Glu Pro Val Glu Ala Lys Asp Asp 
265 270 275 280 

ggc tgg get gtt gea gat gat gcg gge tec ttt gaa cat cae cag aae 976 
Gly Trp Ala Val Ala Asp Asp Ala Gly Ser Phe Glu His His Gin Asn 
285 290 295 

eat gat tec gga cet ttg gea ggg gag aac gtc atg aac gtg gtc gtc 1024 
His Asp Ser Gly Pro Leu Ala Gly Glu Asn Val Met Asn Val Val Val 
300 305 310 

gtg get get gaa tgt tct ccc tgg tgc aaa aca ggt ggt ctt gga gat 1072 
Val Ala Ala Glu Cys Ser Pro Trp Cys Lys Thr Gly Gly Leu Gly Asp 
315 320 325 



1120 



atg gtt gtg gta cca agg tat ggg gac tat gag gaa gee tac gat gtc 1168 
Met Val Val Val Pro Arg Tyr Gly Asp Tyr Glu Glu Ala Tyr Asp Val 
345 350 355 360 

gga gtc cga aaa tac tac aag get get gga cag gat atg gaa gtg aat 1216 
Gly Val Arg Lys Tyr Tyr Lys Ala Ala Gly Gin Asp Met Glu Val Asn 
365 370 375 

tat ttc cat get tat ate gat gga gtt gat ttt gtg ttc att gac get 1264 
Tyr Phe His Ala Tyr He Asp Gly Val Asp Phe Val Phe He Asp Ala 
380 385 390 

cet etc ttc cga cae cgc cag gaa gac att tat ggg ggc age aga cag 1312 
Pro Leu Phe Arg His Arg Gin Glu Asp He Tyr Gly Gly Ser Arg Gin 
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395 400 405 

gaa att atg aag cgc atg att ttg ttc tgc aag gcc get gtc gag gtt 1360 
Glu lie Met Lys Arg Met lie Leu Phe Cys Lys Ala Ala Val Glu Val 
410 415 420 

cct tgg cac gtt cca tgc ggc ggt gtc cct tat ggg gat gga aat ctg 1408 
Pro Trp His Val Pro Cys Gly Gly Val Pro Tyr Gly Asp Gly Asn Leu 
425 430 435 440 

gtg ttt att gca aat gat tgg cac acg gca etc ctg cct gtc tat ctg 1456 
Val Phe lie Ala Asn Asp Trp His Thr Ala Leu Leu Pro Val Tyr Leu 
445 450 455 

aaa gca tat tac agg gac cat ggt ttg atg cag tac act egg tec att 1504 
Lys Ala Tyr Tyr Arg Asp His Gly Leu Met Gin Tyr Thr Arg Ser lie 
460 465 470 

^tg gtg ata cat aac ate gcg cac cag ggc cgt ggc cca gta gat gaa 1552 
Met Val lie His Asn lie Ala His Gin Gly Arg Gly Pro Val Asp Glu 
475 480 485 

ttc ecg ttc acc gag ttg cct gag cac tac ctg gaa cac ttc aga ctg 1600 
Phe Pro Phe Thr Glu Leu Pro Glu His Tyr Leu Glu His Phe Arg Leu 
490 495 500 

tac gac ccc gtg ggt ggt gag cac gcc aac tac ttc gcc gcc ggc ctg 1648 
Tyr Asp Pro Val Gly Gly Glu His Ala Asn Tyr Phe Ala Ala Gly Leu 
505 510 515 520 

aag atg gcg gac cag gtt gtc gtg gtg age ccc ggg tac ctg tgg gag 1696 
Lys Met Ala Asp Gin Val Val Val Val Ser Pro Gly Tyr Leu Trp Glu 
525 530 535 

etc aag acg gtg gag ggc ggc tgg ggg ett cac gac ate ata egg cag 1744 
Leu Lys Thr Val Glu Gly Gly Trp Gly Leu His Asp lie He Arg Gin 
540 545 550 

aac gac tgg aag acc cgc ggc ate gtc aac ggc ate gac aac atg gag 1792 
Asn Asp Trp Lys Thr Arg Gly He Val Asn Gly lie Asp Asn Met Glu 
555 560 565 

tgg aac ccc gag gtg gac gtc cac etc aag teg gac ggc tac ace aac 184 0 
Trp Asn Pro Glu Val Asp Val His Leu Lys Ser Asp Gly Tyr Thr Asn 
570 575 580 

ttc tec ctg ggg acg ctg gac tec ggc aag egg cag tgc aag gag gcc 1888 
Phe Ser Leu Gly Thr Leu Asp Ser Gly Lys Arg Gin Cys Lys Glu Ala 
585 590 595 600 

ctg cag cgc gag ctg ggc ctg cag gtc cgc gcc gac gtg ccg ctg etc 1936 
Leu Gin Arg Glu Leu Gly Leu Gin Val Arg Ala Asp Val Pro Leu Leu 
605 610 615 

ggc ttc ate ggc cgc ctg gac ggg cag aag ggc gtg gag ate ate gcg 1984 
Gly Phe He Gly Arg Leu Asp Gly Gin Lys Gly Val Glu He He Ala 
620 625 630 

gac gcc atg ccc tgg ate gtg age cag gac gtg cag ctg gtc atg ctg 2032 
Asp Ala Met Pro Trp He Val Ser Gin Asp Val Gin Leu Val Met Leu 
635 640 645 

ggc ace ggc cgc cac gac ctg gag age atg ctg egg cac ttc gag egg 2080 
Gly Thr Gly Arg His Asp Leu Glu Ser Met Leu Arg His Phe Glu Arg 
650 655 660 
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gag cac cac gac aag gtg cgc ggg tgg gtg ggg ttc tec gtg cgc ctg 2128 
Glu His His Asp Lys Val Arg Gly Trp Val Gly Phe Ser Val Arg Leu 
665 670 675 680 

gcg cac egg ate acg gcg ggc gcc gac gcg etc etc atg ccc tec egg 2176 
Ala His Arg lie Thr Ala Gly Ala Asp Ala Leu Leu Met Pro Ser Arg 
685 690 695 

ttc gag ccg tgc ggg ttg aac cag ctt tac gcc atg gcc tac ggc acc 2224 
Phe Glu Pro Cys Gly Leu Asn Gin Leu Tyr Ala Met Ala Tyr Gly Thr 
700 705 710 

gtc ccc gtc gtg cac gcc gtc ggc ggg gtg agg gac acc gtg ccg ccg 2272 
Val Pro Val Val His Ala Val Gly Gly Val Arg Asp Thr Val Pro Pro 
715 720 725 

ttc gac ccc ttc aac cac tec ggc etc ggg tgg acg ttc gac cgc gcc 2320 
Phe Asp Pro Phe Asn His Ser Gly Leu Gly Trp Thr Phe Asp Arg Ala 
730 735 740 

gag gcg cac aag ctg ate gag gcg etc ggg cac tgc etc cgc acc tac 2368 
Glu Ala His Lys Leu lie Glu Ala Leu Gly His Cys Leu Arg Thr Tyr 
745 750 755 760 

egg gac tac aag gag age tgg agg ggc etc cag gag cgc ggc atg teg 2416 
Arg Asp Tyr Lys Glu Ser Trp Arg Gly Leu Gin Glu Arg Gly Met Ser 
765 770 775 

cag gac ttc age tgg gag cat gcc gcc aag etc tac gag gac gtc etc 2464 
Gin Asp Phe Ser Trp Glu His Ala Ala Lys Leu Tyr Glu Asp Val Leu 
780 785 790 

etc aag gee aag tac cag tgg tga aegetagctg etagcegctc cagececgca 2518 
Leu Lys Ala Lys Tyr Gin Trp 

795 800 

tgcgtgcatg eatgagaggg tggaactgeg cattgegcec gcaggaacgt gecatcctte 2578 

tcgatgggag egceggcatc cgcgaggtge agtgacatga gaggtgtgtg tggttgagae 2638 

getgattccg atetcgatet ggtccgtage agagtagage ggacgtaggg aagcgctcct 2698 

tgttgcaggt atatgggaat gttgtcaact tggtattgta gtttgctatg ttgtatgcgt 2758 

tattacaatg ttgttaetta ttcttgttaa aaaaaaaaaa aa 2800 



<210> 4 
<211> 799 
<212> PRT 

<213> Tritieum aestivum 
<400> 4 

Met Ser Ser Ala Val Ala Ser Ala Ala Ser Phe Leu Ala Leu Ala Ser 
15 10 15 

Ala Ser Pro Gly Arg Ser Arg Arg Arg Ala Arg Val Ser Ala Pro Pro 
20 25 30 

Pro His Ala Gly Ala Gly Arg Leu His Trp Pro Pro Trp Pro Pro Gin 
35 40 45 



Arg Thr Ala Arg Asp Gly Gly Val Ala Ala Arg Ala Ala Gly Lys Lys 
50 55 60 
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Asp Ala Arg Val Asp Asp Asp Ala Ala Ser Ala Arg Gin Pro Arg Ala 
65 70 75 80 

Arg Arg Gly Gly Ala Ala Thr Lys Val Ala Glu Arg Arg Asp Pro Val 
85 90 95 

hys Thr Leu Asp Arg Asp Ala Ala Glu Gly Gly Ala Pro Ala Pro Pro 

100 105 110 

Ala Pro Arg Gin Asp Ala Ala Arg Pro Pro Ser Met Asn Gly Thr Pro 
115 120 125 

Val Asn Gly Glu Asn Lys Ser Thr Gly Gly Gly Gly Ala Thr Lys Asp 
130 135 140 

Ser Gly Leu Pro Ala Pro Ala Arg Ala Pro His Pro Ser Thr Gin Asn 
145 150 155 160 

Arg Val Pro Val Asn Gly Glu Asn Lys Ala Asn Val Ala Ser Pro Pro 
165 170 175 

Thr Ser lie Ala Glu Val Val Ala Pro Asp Ser Ala Ala Thr He Ser 
180 185 190 

He Ser Asp Lys Ala Pro Glu Ser Val Val Pro Ala Glu Lys Pro Pro 
195 200 205 

Pro Ser Ser Gly Ser Asn Phe Val Val Ser Ala Ser Ala Pro Arg Leu 

210 215 220 

Asp He Asp Ser Asp Val Glu Pro Glu Leu Lys Lys Gly Ala Val He 
225 230 235 240 

Val Glu Glu Ala Pro Asn Pro Lys Ala Leu Ser Pro Pro Ala Ala Pro 

245 250 255 

Ala Val Gin Glu Asp Leu Trp Asp Phe Lys Lys Tyr He Gly Phe Glu 
260 265 270 

Glu Pro Val Glu Ala Lys Asp Asp Gly Trp Ala Val Ala Asp Asp Ala 
275 280 285 

Gly Ser Phe Glu His His Gin Asn His Asp Ser Gly Pro Leu Ala Gly 
290 295 300 

Glu Asn Val Met Asn Val Val Val Val Ala Ala Glu Cys Ser Pro Trp 
305 310 315 320 

Cys Lys Thr Gly Gly Leu Gly Asp Val Ala Gly Ala Leu Pro Lys Ala 
325 330 335 

Leu Ala Lys Arg Gly His Arg Val Met Val Val Val Pro Arg Tyr Gly 
340 345 350 

Asp Tyr Glu Glu Ala Tyr Asp Val Gly Val Arg Lys Tyr Tyr Lys Ala 

355 360 365 

Ala Gly Gin Asp Met Glu Val Asn Tyr Phe His Ala Tyr He Asp Gly 
370 375 380 

Val Asp Phe Val Phe He Asp Ala Pro Leu Phe Arg His Arg Gin Glu 
385 390 395 400 



Asp He Tyr Gly Gly Ser Arg Gin Glu He Met Lys Arg Met He Leu 
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405 410 415 

Phe Cys Lys Ala Ala Val Glu Val Pro Trp His Val Pro Cys Gly Gly 
420 425 430 

Val Pro Tyr Gly Asp Gly Asn Leu Val Phe He Ala Asn Asp Trp His 
435 440 445 

Thr Ala Leu Leu Pro Val Tyr Leu Lys Ala Tyr Tyr Arg Asp His Gly 
450 455 460 

Leu Met Gin Tyr Thr Arg Ser He Met Val He His Asn He Ala His 
465 470 475 480 

Gin Gly Arg Gly Pro Val Asp Glu Phe Pro Phe Thr Glu Leu Pro Glu 
485 490 495 

His Tyr Leu Glu His Phe Arg Leu Tyr Asp Pro Val Gly Gly Glu His 
500 505 510 

Ala Asn Tyr Phe Ala Ala Gly Leu Lys Met Ala Asp Gin Val Val Val 
515 520 525 

Val Ser Pro Gly Tyr Leu Trp Glu Leu Lys Thr Val Glu Gly Gly Trp 
530 535 540 

Gly Leu His Asp He He Arg Gin Asn Asp Trp Lys Thr Arg Gly He 
545 550 555 560 

Val Asn Gly He Asp Asn Met Glu Trp Asn Pro Glu Val Asp Val His 

565 570 575 

Leu Lys Ser Asp Gly Tyr Thr Asn Phe Ser Leu Gly Thr Leu Asp Ser 
580 585 590 

Gly Lys Arg Gin Cys Lys Glu Ala Leu Gin Arg Glu Leu Gly Leu Gin 
595 600 605 

Val Arg Ala Asp Val Pro Leu Leu Gly Phe He Gly Arg Leu Asp Gly 
610 615 620 

Gin Lys Gly Val Glu He He Ala Asp Ala Met Pro Trp He Val Ser 
625 630 635 640 

Gin Asp Val Gin Leu Val Met Leu Gly Thr Gly Arg His Asp Leu Glu 

645 650 655 

Ser Met Leu Arg His Phe Glu Arg Glu His His Asp Lys Val Arg Gly 
660 665 670 

Trp Val Gly Phe Ser Val Arg Leu Ala His Arg He Thr Ala Gly Ala 
675 680 685 

Asp Ala Leu Leu Met Pro Ser Arg Phe Glu Pro Cys Gly Leu Asn Gin 
690 695 700 

Leu Tyr Ala Met Ala Tyr Gly Thr Val Pro Val Val His Ala Val Gly 
705 710 715 720 

Gly Val Arg Asp Thr Val Pro Pro Phe Asp Pro Phe Asn His Ser Gly 
725 730 735 



Leu Gly Trp Thr Phe Asp Arg Ala Glu Ala His Lys Leu He Glu Ala 
740 745 750 
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Leu Gly His Cys Leu Arg Thr Tyr Arg Asp Tyr Lys Glu Ser Trp Arg 
755 760 765 

Gly Leu Gin Glu Arg Gly Met Ser Gin Asp Phe Ser Trp Glu His Ala 

770 775 780 



Ala Lys Leu Tyr Glu Asp Val Leu Leu Lys Ala Lys Tyr Gin Trp 
785 790 795 



<210> 5 
<211> 2107 
<212> DNA 

<213> Triticum aestivum 

<220> 

<221> CDS 

<222> (1) . . <1794) 

<400> 5 

cca get gag aag acg ccg ccg teg tec ggc tea aat ttc gag tec teg 48 
Pro Ala Glu Lys Thr Pro Pro Ser Ser Gly Ser Asn Phe Glu Ser Ser 
15 10 15 

gcc tct get ccc ggg tct gac act gtc age gac gtg gaa caa gaa ctg 96 
Ala Ser Ala Pro Gly Ser Asp Thr Val Ser Asp Val Glu Gin Glu Leu 
20 25 30 

aag aag ggt gcg gtc gtt gtc gaa gaa get cca aag cca aag get ett 144 
Lys Lys Gly Ala Val Val Val Glu Glu Ala Pro Lys Pro Lys Ala Leu 
35 40 45 

teg ccg cct gca gcc ccc get gta caa gaa gac ctt tgg gat ttc aag 192 
Ser Pro Pro Ala Ala Pro Ala Val Gin Glu Asp Leu Trp Asp Phe Lys 
50 55 60 

aaa tac att ggt ttc gag gag ccc gtg gag gcc aag gat gat ggc egg 240 
Lys Tyr He Gly Phe Glu Glu Pro Val Glu Ala Lys Asp Asp Gly Arg 
65 70 75 80 

get gtc gea gat gat gcg ggc tec ttt gaa cac cac cag aat cac gac 288 
Ala Val Ala Asp Asp Ala Gly Ser Phe Glu His His Gin Asn His Asp 
85 90 95 

tec gga cct ttg gca ggg gag aat gtc atg aac gtg gtc gtc gtg get 336 
Ser Gly Pro Leu Ala Gly Glu Asn Val Met Asn Val Val Val Val Ala 
100 105 110 

get gag tgt tct eee tgg tgc aaa aca ggt ggt ctg gga gat gtt gcg 3 84 
Ala Glu Cys Ser Pro Trp Cys Lys Thr Gly Gly Leu Gly Asp Val Ala 

115 120 125 

ggt get ctg eee aag get ttg gea aag aga gga cat egt gtt atg gtt 432 
Gly Ala Leu Pro Lys Ala Leu Ala Lys Arg Gly His Arg Val Met Val 
130 135 140 

gtg gta cca agg tat ggg gac tat gaa gaa cct acg gat gtc gga gtc 4 80 
Val Val Pro Arg Tyr Gly Asp Tyr Glu Glu Pro Thr Asp Val Gly Val 
145 150 155 160 

cga aaa tac tac aag get get gga cag gat atg gaa gtg aat tat ttc 528 
Arg Lys Tyr Tyr Lys Ala Ala Gly Gin Asp Met Glu Val Asn Tyr Phe 
165 170 175 



cat get tat ate gat gga gtt gat ttt gtg ttc att gac get cct etc 576 
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His Ala Tyr He Asp Gly Val Asp Phe Val Phe He Asp Ala Pro Leu 

180 185 190 

ttc cga cac cga gag gaa gac att tat ggg ggc age aga cag gaa att 

Phe Arg His Arg Glu Glu Asp He Tyr Gly Gly Ser Arg Gin Glu He 

195 200 205 

atg aag cgc atg att ttg ttc tgc aag gcc get gtt gag gtt cca tgg 

Met Lys Arg Met He Leu Phe Cys Lys Ala Ala Val Glu Val Pro Trp 

210 215 220 

cac gtt cca tgc ggc ggt gtc cct tat ggg gat gga aat ctg gtg ttt 

His Val Pro Cys Gly Gly Val Pro Tyr Gly Asp Gly Asn Leu Val Phe 

225 230 235 240 

att gca aat gat tgg cac acg gca etc ctg cct gtc tat ctg aaa gca 

He Ala Asn Asp Trp His Thr Ala Leu Leu Pro Val Tyr Leu Lys Ala 

245 250 255 

tat tac agg gac cat ggt ttg atg cag tac act egg tec att atg gtg 

Tyr Tyr Arg Asp His Gly Leu Met Gin Tyr Thr Arg Ser He Met Val 

260 265 270 



624 



672 



720 



768 



816 



ata cat aac ate get cac cag ggc cgt ggc cct gta gat gaa ttc ceg 
He His Asn He Ala His Gin Gly Arg Gly Pro Val Asp Glu Phe Pro 
275 280 285 

ttc acc gag ttg cct gag cac tac ctg gaa cac ttc aga ctg tac gac 
Phe Thr Glu Leu Pro Glu His Tyr Leu Glu His Phe Arg Leu Tyr Asp 
290 295 300 

ccc gtg ggt ggt gaa cac gcc aac tac ttc gcc gcc' ggc ctg aag atg 
Pro Val Gly Gly Glu His Ala Asn Tyr Phe Ala Ala Gly Leu Lys Met 
305 310 315 320 

gcg gac cag gtt gtc gtg gtg age ccc ggg tac ctg tgg gag ctg aag 
Ala Asp Gin Val Val Val Val Ser Pro Gly Tyr Leu Trp Glu Leu Lys 
325 330 335 

acg gtg gag ggc ggc tgg ggg ctt cac gac ate ata egg cag aac gac 
Thr Val Glu Gly Gly Trp Gly Leu His Asp He He Arg Gin Asn Asp 
340 345 350 



864 



912 



960 



1008 



1056 



tgg aag acc cgc ggc ate gtc aac ggc ate gac aac atg gag tgg aac 
Trp Lys Thr Arg Gly He Val Asn Gly He Asp Asn Met Glu Trp Asn 
355 360 365 

ccc gag gtg gac gcc cac etc aag teg gac ggc tac acc aac ttc tee 
Pro Glu Val Asp Ala His Leu Lys Ser Asp Gly Tyr Thr Asn Phe Ser 

370 375 380 



1104 



1152 



ctg agg acg ctg gac tec ggc aag egg cag tgc aag gag gcc ctg cag 12 00 
Leu Arg Thr Leu Asp Ser Gly Lys Arg Gin Cys Lys Glu Ala Leu Gin 
385 390 395 400 

cgc gag ctg ggc ctg cag gtc cgc gcc gac gtg ceg ctg etc ggc ttc 124 8 
Arg Glu Leu Gly Leu Gin Val Arg Ala Asp Val Pro Leu Leu Gly Phe 
405 410 415 



ate ggc cgc ctg gac ggg cag aag ggc gtg gag ate ate gcg gac gee 
He Gly Arg Leu Asp Gly Gin Lys Gly Val Glu He He Ala Asp Ala 
420 425 430 



1296 



atg ccc tgg ate gtg age cag gac gtg cag ctg gtg atg ctg ggc acc 
Met Pro Trp He Val Ser Gin Asp Val Gin Leu Val Met Leu Gly Thr 



1344 
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435 440 445 

999 cgc cac gac ctg gag age atg ctg cag cac ttc gag egg gag cac 1392 
Gly Arg His Asp Leu Glu Ser Met Leu Gin His Phe Glu Arg Glu His 
450 455 460 

cac gac aag gtg cgc ggg tgg gtg ggg ttc tec gtg cgc ctg gcg cac 1440 
His Asp Lys Val Arg Gly Trp Val Gly Phe Ser Val Arg Leu Ala His 
465 470 475 480 

egg ate acg gcg ggg gcg gac gcg etc etc atg ccc tec egg ttc gtg 14 88 
Arg lie Thr Ala Gly Ala Asp Ala Leu Leu Met Pro Ser Arg Phe Val 
485 490 495 

ccg tgc ggg ctg aac cag etc tac gee atg gee tac ggc acc gtc ccc 1536 
Pro Cys Gly Leu Asn Gin Leu Tyr Ala Met Ala Tyr Gly Thr Val Pro 
500 505 510 

gtc gtg cac gee gtc ggc ggc etc agg gac acc gtg ccg ccg ttc gac 1584 
Val Val His Ala Val Gly Gly Leu Arg Asp Thr Val Pro Pro Phe Asp 
515 520 525 

ccc ttc aac cac tec ggg etc ggg tgg acg ttc gac cgc gee gag gcg 1632 
Pro Phe Asn His Ser Gly Leu Gly Trp Thr Phe Asp Arg Ala Glu Ala 
530 535 540 

cac aag ctg ate gag gcg etc ggg cac tgc etc cgc ace tac ega gac 1680 
His Lys Leu lie Glu Ala Leu Gly His Cys Leu Arg Thr Tyr Arg Asp 
545 550 555 560 

ttc aag gag age tgg agg gee etc cag gag cgc ggc atg teg cag gac 1728 
Phe Lys Glu Ser Trp Arg Ala Leu Gin Glu Arg Gly Met Ser Gin Asp 
565 570 575 

ttc age tgg gag cac gcc gee aag etc tac gag gae gtc etc gtc aag 1776 
Phe Ser Tarp Glu His Ala Ala Lys Leu Tyr Glu Asp Val Leu Val Lys 
580 585 590 

gcc aag tac cag tgg tga acgetagctg ctagccgctc cagccccgca 1824 
Ala Lys Tyr Gin Trp 
595 

tgcgtgcatg aeaggatgga aetgcattge geacgeagga aagtgccatg gagcgccggc 1884 
atecgegaag tacagtgaca tgaggtgtgt gtggttgaga cgetgattcc aatccggccc 1944 
gtagcagagt agagcggagg tatatgggaa tettaacttg gtattgtaat ttgttatgtt 2004 
gtgtgcatta ttacaatgtt gttacttatt cttgttaagt cggaggceaa gggegaaagc 2064 
tageteacat gtetgatgga tgcaaaaaaa aaaaaaaaaa aaa 2107 



<210> 6 
<211> 597 
<212> PRT 

<213> Triticum aestivum 
<400> 6 

Pro Ala Glu Lys Thr Pro Pro Ser Ser Gly Ser Asn Phe Glu Ser Ser 
15 10 15 



Ala Ser Ala Pro Gly Ser Asp Thr Val Ser Asp Val Glu Gin Glu Leu 
20 25 30 
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Lys Lys Gly Ala Val Val Val Glu Glu Ala Pro Lys Pro Lys Ala Leu 
35 40 45 

Ser Pro Pro Ala Ala Pro Ala Val Gin Glu Asp Leu Trp Asp Phe Lys 
50 55 60 

Lys Tyr He Gly Phe Glu Glu Pro Val Glu Ala Lys Asp Asp Gly Arg 
65 70 75 80 

Ala Val Ala Asp Asp Ala Gly Ser Phe Glu His His Gin Asn His Asp 
85 90 95 

Ser Gly Pro Leu Ala Gly Glu Asn Val Met Asn Val Val Val Val Ala 
100 105 110 

Ala Glu Cys Ser Pro Trp Cys Lys Thr Gly Gly Leu Gly Asp Val Ala 
115 120 125 

Gly Ala Leu Pro Lys Ala Leu Ala Lys Arg Gly His Arg Val Met Val 
130 135 140 

Val Val Pro Arg Tyr Gly Asp Tyr Glu Glu Pro Thr Asp Val Gly Val 
145 150 155 160 

Arg Lys Tyr Tyr Lys Ala Ala Gly Gin Asp Met Glu Val Asn Tyr Phe 
165 170 175 

His Ala Tyr He Asp Gly Val Asp Phe Val Phe He Asp Ala Pro Leu 

180 185 190 

Phe Arg His Arg Glu Glu Asp He Tyr Gly Gly Ser Arg Gin Glu He 
195 200 205 

Met Lys Arg Met He Leu Phe Cys Lys Ala Ala Val Glu Val Pro Trp 
210 215 220 

His Val Pro Cys Gly Gly Val Pro Tyr Gly Asp Gly Asn Leu Val Phe 
225 230 235 240 

He Ala Asn Asp Trp His Thr Ala Leu Leu Pro Val Tyr Leu Lys Ala 
245 250 255 

Tyr Tyr Arg Asp His Gly Leu Met Gin Tyr Thr Arg Ser He Met Val 
260 265 270 

He His Asn He Ala His Gin Gly Arg Gly Pro Val Asp Glu Phe Pro 
275 280 285 

Phe Thr Glu Leu Pro Glu His Tyr Leu Glu His Phe Arg Leu Tyr Asp 

290 295 300 

Pro Val Gly Gly Glu His Ala Asn Tyr Phe Ala Ala Gly Leu Lys Met 
305 310 315 320 

Ala Asp Gin Val Val Val Val Ser Pro Gly Tyr Leu Trp Glu Leu Lys 
325 330 335 

Thr Val Glu Gly Gly Trp Gly Leu His Asp He He Arg Gin Asn Asp 
340 345 350 

Trp Lys Thr Arg Gly He Val Asn Gly He Asp Asn Met Glu Trp Asn 
355 360 365 

Pro Glu Val Asp Ala His Leu Lys Ser Asp Gly Tyr Thr Asn Phe Ser 
370 375 380 
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Leu Arg Thr Leu Asp Ser Gly Lys Arg Gin Cys Lys Glu Ala Leu Gin 
385 390 395 400 

Arg Glu Leu Gly Leu Gin Val Arg Ala Asp Val Pro Leu Leu Gly Phe 
405 410 415 

He Gly Arg Leu Asp Gly Gin Lys Gly Val Glu He He Ala Asp Ala 
420 425 430 

Met Pro Trp He Val Ser Gin Asp Val Gin Leu Val Met Leu Gly Thr 
435 440 445 

Gly Arg His Asp Leu Glu Ser Met Leu Gin His Phe Glu Arg Glu His 
450 455 460 

His Asp Lys Val Arg Gly Trp Val Gly Phe Ser Val Arg Leu Ala His 
465 470 475 480 

Arg He Thr Ala Gly Ala Asp Ala Leu Leu Met Pro Ser Arg Phe Val 
485 490 495 

Pro Cys Gly Leu Asn Gin Leu Tyr Ala Met Ala Tyr Gly Thr Val Pro 
500 505 510 

Val Val His Ala Val Gly Gly Leu Arg Asp Thr Val Pro Pro Phe Asp 
515 520 525 

Pro Phe Asn His Ser Gly Leu Gly Trp Thr Phe Asp Arg Ala Glu Ala 

530 535 540 

His Lys Leu He Glu Ala Leu Gly His Cys Leu Arg Thr Tyr Arg Asp 
545 550 555 560 

Phe Lys Glu Ser Trp Arg Ala Leu Gin Glu Arg Gly Met Ser Gin Asp 
565 570 575 

Phe Ser Trp Glu His Ala Ala Lys Leu Tyr Glu Asp Val Leu Val Lys 
580 585 590 

Ala Lys Tyr Gin Trp 
595 



<210> 7 
<211> 5352 
<212> DNA 

<213> Triticum aestivum 

<220> 
<221> CDS 

<222> (29) . . (4921) 
<400> 7 

cggcacgagg tttagtaggt tccgggaa atg gag atg tct etc tgg cca egg 52 

Met Glu Met Ser Leu Trp Pro Arg 
1 5 

age ccc ctg tgc cct egg age agg cag eeg etc gtc gtc gtc egg ceg lOO 
Ser Pro Leu Cys Pro Arg Ser Arg Gin Pro Leu Val Val Val Arg Pro 
10 15 20 

gee ggc cgc ggc ggc etc acg cag cct ttt ttg atg aat ggc aga ttt 148 
Ala Gly Arg Gly Gly Leu Thr Gin Pro Phe Leu Met Asn Gly Arg Phe 
25 30 35 40 
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act cga age agg acc ctt cga tgc atg gta gca agt tea gat cct cct 196 
Thr Arg Ser Arg Thr Leu Arg Cys Met Val Ala Ser Ser Asp Pro Pro 
45 50 55 

aat agg aaa tea aga agg atg gta cca cct cag gtt aaa gtc att tct 
Asn Arg Lys Ser Arg Arg Met Val Pro Pro Gin Val Lys Val lie Ser 
60 65 70 

tct aga gga tat acg aca aga etc att gtt gaa cca age aac gag aat 
Ser Arg Gly Tyr Thr Thr Arg Leu He Val Glu Pro Ser Asn Glu Asn 
75 80 85 

aca gaa cac aat aat egg gat gaa gaa act ctt gat aca tac aat gcg 
Thr Glu His Asn Asn Arg Asp Glu Glu Thr Leu Asp Thr Tyr Asn Ala 
90 95 100 

eta tta agt ace gag aca gca gaa tgg aca gat aat aga gaa gee gag 
Leu Leu Ser Thr Glu Thr Ala Glu Trp Thr Asp Asn Arg Glu Ala Glu 
105 110 115 120 

act get aaa gcg gac teg teg caa aat get tta age agt tct ata att 436 
Thr Ala Lys Ala Asp Ser Ser Gin Asn Ala Leu Ser Ser Ser He He 
125 130 135 

SSS 9aa gtg gat gtg gcg gat gaa gat ata ctt gcg get gat etg aca 484 
Gly Glu Val Asp Val Ala Asp Glu Asp He Leu Ala Ala Asp Leu Thr 
140 145 150 

gtg tat tea ttg age agt gta atg aag aag gaa gtg gat gca gcg gac 532 
Val Tyr Ser Leu Ser Ser Val Met Lys Lys Glu Val Asp Ala Ala Asp 
155 160 165 



aaa get aga gtt aaa gaa gac gca ttt gag etg gat ttn gee age act 
Lys Ala Arg Val Lys Glu Asp Ala Phe Glu Leu Asp Xaa Ala Ser Thr 
170 175 180 

aca ttg aga agt gtg ata gta gat gtg atg gat cat aan tgg gac tgt 
Thr Leu Arg Ser Val He Val Asp Val Met Asp His Xaa Trp Asp Cys 
185 190 195 200 

caa gag aca ttg aga agt gtg ata gta gat gtg atg gat eat aat ggg 
Gin Glu Thr Leu Arg Ser Val He Val Asp Val Met Asp His Asn Gly 
205 210 215 



tea gga aat att tea age agt gcg acg ace gtg gaa eta gat gcg gtt 
Ser Gly Asn He Ser Ser Ser Ala Thr Thr Val Glu Leu Asp Ala Val 
250 255 260 



aat gtt tea aac agt gca acg gta egg gaa gtg gat gca agt gat gaa 
Asn Val Ser Asn Ser Ala Thr Val Arg Glu Val Asp Ala Ser Asp Glu 
285 290 295 



244 



292 



340 



388 



580 



628 



676 



act gta caa gag aca ttg aga agt gtg ata gta gat gtg atg gat gat 724 
Thr Val Gin Glu Thr Leu Arg Ser Val He Val Asp Val Met Asp Asp 
220 225 230 

gcg gcg gac aaa get aga gtt gaa gaa gac gta ttt gag etg gat ttg 772 
Ala Ala Asp Lys Ala Arg Val Glu Glu Asp Val Phe Glu Leu Asp Leu 
235 240 245 



820 



gac gaa gtc ggg cct gtt caa gac aaa ttt gag gcg acc tea tea gga 868 
Asp Glu Val Gly Pro Val Gin Asp Lys Phe Glu Ala Thr Ser Ser Gly 
265 270 275 280 



916 
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gct ggg aat gat caa ggc ata ttt aga gca gat ttg tea gga aat gtt 964 
Ala Gly Asn Asp Gin Gly lie Phe Arg Ala Asp Leu Ser Gly Asn Val 
300 305 310 

ttt tea age agt aca aca gtg gaa gtg ggt gca gtg gat gaa get ggg 1012 
Phe Ser Ser Ser Thr Thr Val Glu Val Gly Ala Val Asp Glu Ala Gly 
315 320 325 

tct ata aag gae agg ttt gag acg gat teg tea gga aat gtt tea aca 1060 
Ser lie Lys Asp Arg Phe Glu Thr Asp Ser Ser Gly Asn Val Ser Thr 
330 335 340 

agt gcg ccg atg tgg gat gca att gat gaa ace gtg get gat caa gae 1108 
Ser Ala Pro Met Trp Asp Ala lie Asp Glu Thr Val Ala Asp Gin Asp 
345 350 355 360 

aca ttt gag gcg gat ttg teg gga aat get tea age tge gca aca tac 1156 
Thr Phe Glu Ala Asp Leu Ser Gly Asn Ala Ser Ser Cys Ala Thr Tyr 
365 370 375 

aga gaa gtg gat gat gtg gtg gat gaa act aga tea gaa gag gaa aca 1204 
Arg Glu Val Asp Asp Val Val Asp Glu Thr Arg Ser Glu Glu Glu Thr 
380 385 390 

ttt gca atg gat ttg ttt gca agt gaa tea ggc cat gag aaa eat atg 1252 
Phe Ala Met Asp Leu Phe Ala Ser Glu Ser Gly His Glu Lys His Met 
395 400 405 

gca gtg gat tat gtg ggt gaa get aee gat gaa gaa gag act tac eaa 1300 
Ala Val Asp Tyr Val Gly Glu Ala Thr Asp Glu Glu Glu Thr Tyr Gin 
410 415 420 

cag caa tat cca gta ccg tct tea tte tct atg tgg gae aag get att 1348 
Gin Gin Tyr Pro Val Pro Ser Ser Phe Ser Met Trp Asp Lys Ala lie 
425 430 435 440 

get aaa aca ggt gta agt ttg aat cet gag ctg ega ett gte agg gtt 1396 
Ala Lys Thr Gly Val Ser Leu Asn Pro Glu Leu Arg Leu Val Arg Val 
445 450 455 

gaa gaa caa ggc aaa gta aat ttt agt gat aaa aaa gae ctg tea att 1444 
Glu Glu Gin Gly Lys Val Asn Phe Ser Asp Lys Lys Asp Leu Ser lie 
460 465 470 

gat gat tta cca gga eaa aae caa teg ate att ggt tec tat aaa eaa 1492 
Asp Asp Leu Pro Gly Gin Asn Gin Ser lie lie Gly Ser Tyr Lys Gin 
475 480 485 

gat aaa tea att get gat gtt gcg gga ccg ace eaa tea att ttt ggt 1540 
Asp Lys Ser He Ala Asp Val Ala Gly Pro Thr Gin Ser He Phe Gly 
490 495 500 

tct agt aaa caa cac egg tea att gtt get ttc ccc aaa caa aac cag 1588 
Ser Ser Lys Gin His Arg Ser He Val Ala Phe Pro Lys Gin Asn Gin 
505 510 515 520 

tea att gtt agt gtc act gag caa aag cag tec ata gtt gga ttc cgt 1636 
Ser He Val Ser Val Thr Glu Gin Lys Gin Ser He Val Gly Phe Arg 
525 530 535 

agt caa gat ctt teg get gtt agt etc cet aaa caa aac gta cca att 1684 
Ser Gin Asp Leu Ser Ala Val Ser Leu Pro Lys Gin Asn Val Pro He 
540 545 550 

gtt ggg tac gtc gag aga ggg tea aae naa aag caa gtt cet gtt gtt 1732 
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Val Gly Tyr Val Glu Arg Gly Ser Asn Xaa Lys Gin Val Pro Val Val 
555 560 565 

gat aga cag gat gca ttg tat gtg aat gga ctg gaa get aag gag gga 1780 
Asp Arg Gin Asp Ala Leu Tyr Val Asn Gly Leu Glu Ala Lys Glu Gly 
570 575 580 

gat cac aca tec gag aaa act gat gag gat gcg ctt cat gta aag ttt 1828 
Asp His Thr Ser Glu Lys Thr Asp Glu Asp Ala Leu His Val Lys Phe 
585 590 595 600 

aat gtt gac aat gtg ttg egg aag cat cag gca gat aga acc caa gca 1876 
Asn Val Asp Asn Val Leu Arg Lys His Gin Ala Asp Arg Thr Gin Ala 
605 610 615 

gtg gaa aag aaa act tgg aag aaa gtt gat gag gaa cat ctt tac atg 1924 
Val Glu Lys Lys Thr Trp Lys Lys Val Asp Glu Glu His Leu Tyr Met 
620 625 630 

act gaa cat cag aaa cgt get gcc gaa gga cag atg gta gtt aac gag 1972 
Thr Glu His Gin Lys Arg Ala Ala Glu Gly Gin Met Val Val Asn Glu 
635 640 645 

gat gag ctt tct ata act gaa att gga atg ggg aga ggt gat aaa att 2 02 0 
Asp Glu Leu Ser lie Thr Glu lie Gly Met Gly Arg Gly Asp Lys lie 
650 655 660 

cag cat gtg ctt tct gag gaa gag ctt tea tgg tct gaa gat gaa gtg 2068 
Gin His Val Leu Ser Glu Glu Glu Leu Ser Trp Ser Glu Asp Glu Val 
665 670 675 680 

cag tta att gag gat gat gga caa tat gaa gtt gac gag acc tct gtg 2116 
Gin Leu lie Glu Asp Asp Gly Gin Tyr Glu Val Asp Glu Thr Ser Val 
685 690 695 

tec gtt aac gtt gaa caa gat ate cag ggg tea cca cag gat gtt gtg 2164 
Ser Val Asn Val Glu Gin Asp lie Gin Gly Ser Pro Gin Asp Val Val 
700 705 710 

gat ccg caa gca eta aag gtg atg ctg caa gaa etc get gag aaa aat 2212 
Asp Pro Gin Ala Leu Lys Val Met Leu Gin Glu Leu Ala Glu Lys Asn 
715 720 725 

tat teg atg agg aac aag ctg ttt gtt ttt cca gag gta gtg aaa get 2260 
Tyr Ser Met Arg Asn Lys Leu Phe Val Phe Pro Glu Val Val Lys Ala 
730 735 740 

gat tea gtt att gat ctt tat tta aat cgt gac eta aca get ttg gcg 2308 
Asp Ser Val lie Asp Leu Tyr Leu Asn Arg Asp Leu Thr Ala Leu Ala 
745 750 755 760 

aat gaa ecc gat gtc gtc ate aaa gga gca ttc aat ggt tgg aaa tgg 23 56 
Asn Glu Pro Asp Val Val lie Lys Gly Ala Phe Asn Gly Trp Lys Trp 
765 770 775 

agg ctt ttc act gaa aga ttg cac aag agt gac ctt gga ggg gtt tgg 2404 
Arg Leu Phe Thr Glu Arg Leu His Lys Ser Asp Leu Gly Gly Val Trp 
780 785 790 

tgg tct tgc aaa ctg tac ata cec aag gag gcc tac aga tta gac ttt 2452 
Trp Ser Cys Lys Leu Tyr lie Pro Lys Glu Ala Tyr Arg Leu Asp Phe 
795 800 805 

gtg ttc ttc aac ggt egc acg gtc tat gag aac aat ggc aac aat gat 2500 
Val Phe Phe Asn Gly Arg Thr Val Tyr Glu Asn Asn Gly Asn Asn Asp 
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810 815 820 

ttc tgt ata gga ata gaa ggc act atg aat gaa gat ctg ttt gag gat 2548 
Phe Cys lie Gly lie Glu Gly Thr Met Asn Glu Asp Leu Phe Glu Asp 
825 830 835 840 

ttc ttg gtt aaa gaa aag caa agg gag ctt gag aaa ctt gcc atg gaa 2596 
Phe Leu Val Lys Glu Lys Gin Arg Glu Leu Glu Lys Leu Ala Met Glu 
845 850 855 

gaa get gaa agg agg aca cag act gaa gaa cag egg cga aga aag gaa 2644 
Glu Ala Glu Arg Arg Thr Gin Thr Glu Glu Gin Arg Arg Arg Lys Glu 
860 865 870 

gca agg get gca gat gaa get gte agg gea eaa geg aag gee gag ata 2692 
Ala Arg Ala Ala Asp Glu Ala Val Arg Ala Gin Ala Lys Ala Glu lie 
875 880 885 

gag ate aag aag aaa aaa ttg caa agt atg ttg agt ttg gcc aga aca 2740 
Glu lie Lys Lys Lys Lys Leu Gin Ser Met Leu Ser Leu Ala Arg Thr 
890 895 900 

tgt gtt gat aat ttg tgg tac ata gag get age aca gat aca aga gga 2788 
Cys Val Asp Asn Leu Tip Tyr lie Glu Ala Ser Thr Asp Thr Arg Gly 
905 910 915 920 

gat act ate agg tta tat tat aac aga aac teg agg cea ctt gcg cat 2836 
Asp Thr lie Arg Leu Tyr Tyr Asn Arg Asn Ser Arg Pro Leu Ala His 
925 930 935 

agt act gag att tgg atg cat ggt ggt tae aac aat tgg aca gat gga 28 84 
Ser Thr Glu lie Trp Met His Gly Gly Tyr Asn Asn Trp Thr Asp Gly 
940 945 950 

etc tet att gtt gaa age ttt gte aag tge aat gae aaa gae ggc gat 2932 
Leu Ser lie Val Glu Ser Phe Val Lys Cys Asn Asp Lys Asp Gly Asp 
955 960 965 

tgg tgg tat gca gat gtt att cea cet gaa aag gea ctt gtg ttg gae 2980 
Trp Trp Tyr Ala Asp Val lie Pro Pro Glu Lys Ala Leu Val Leu Asp 
970 975 980 

tgg gtt ttt get gat ggg cea get ggg aat gea agg aac tat gae aac 3028 
Trp Val Phe Ala Asp Gly Pro Ala Gly Asn Ala Arg Asn Tyr Asp Asn 
985 990 995 1000 

aat get cga caa gat ttc cat get att ctt ccg aac aac aat gta acc 3 076 
Asn Ala Arg Gin Asp Phe His Ala lie Leu Pro Asn Asn Asn Val Thr 
1005 1010 1015 

gag gaa ggc ttc tgg geg caa gag gag caa aac ate tat aca agg ctt 3124 
Glu Glu Gly Phe Trp Ala Gin Glu Glu Gin Asn lie Tyr Thr Arg Leu 
1020 1025 1030 

ctg caa gaa agg aga gaa aag gaa gaa acc atg aaa aga aag get gag 3172 
Leu Gin Glu Arg Arg Glu Lys Glu Glu Thr Met Lys Arg Lys Ala Glu 
1035 1040 1045 

aga agt gca aat ate aaa get gag atg aag gca aaa act atg cga agg 3220 
Arg Ser Ala Asn lie Lys Ala Glu Met Lys Ala Lys Thr Met Arg Arg 
1050 1055 1060 

ttt ctg ctt tee cag aaa cac att gtt tat acc cga ace gne ttg aaa 3268 
Phe Leu Leu Ser Gin Lys His lie Val Tyr Thr Arg Thr Xaa Leu Lys 
1065 1070 1075 1080 
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tac gtg ccc gga acc aca gtg gat gtg eta tac aat ccc tct aac aca 3316 
Tyr Val Pro Gly Thr Thr Val Asp Val Leu Tyr Asn Pro Ser Asn Thr 
1085 1090 1095 

gtg eta aat gga aag teg gag ggt tgg ttt aga tge tec ttt aae ctt 3364 
Val Leu Asn Gly Lys Ser Glu Gly Trp Phe Arg Cys Ser Phe Asn Leu 
1100 1105 1110 

tgg atg cat tea agt ggg gca ttg cca ccc cag aag atg gtg aaa tea 3412 
Trp Met His Ser Ser Gly Ala Leu Pro Pro Gin Lys Met Val Lys Ser 
1115 1120 1125 

ggg gat ggg ccg etc tta aaa gca aca gtt gat gtt cca ccg gat gee 3460 
Gly Asp Gly Pro Leu Leu Lys Ala Thr Val Asp Val Pro Pro Asp Ala 
1130 1135 1140 

tat atg atg gac ttt gtt ttc tec gag tgg gaa gaa gat ggg ate tat 3 508 
Tyr Met Met Asp Phe Val Phe Ser Glu Trp Glu Glu Asp Gly lie Tyr 
1145 1150 1155 1160 

gac aac agg aat ggg atg gae tat cat att cet gtt tct gat tea att 3556 
Asp Asn Arg Asn Gly Met Asp Tyr His lie Pro Val Ser Asp Ser lie 
1165 1170 1175 

gaa aca gag aat tac atg cgt att ate cac att gee gtt gag atg gee 3604 
Glu Thr Glu Asn Tyr Met Arg lie lie His lie Ala Val Glu Met Ala 
1180 1185 1190 

ccc gtt gca aag gtt gga ggt ctt ggg gat gtt gtt aca agt ctt tea 3 652 
Pro Val Ala Lys Val Gly Gly Leu Gly Asp Val Val Thr Ser Leu Ser 
1195 1200 1205 

cgt gee att caa gat eta gga eat act gte gag gtt att etc eeg aag 3 700 
Arg Ala lie Gin Asp Leu Gly His Thr Val Glu Val lie Leu Pro Lys 
1210 1215 1220 

tae gac tgt ttg aae caa age agt gte aag gat tta cat tta tat caa 3 748 
Tyr Asp Cys Leu Asn Gin Ser Ser Val Lys Asp Leu His Leu Tyr Gin 
1225 1230 1235 1240 

agt ttt tct tgg ggt ggt aca gaa ata aaa gta tgg gtt gga cga gte 3 796 
Ser Phe Ser Trp Gly Gly Thr Glu lie Lys Val Trp Val Gly Arg Val 
1245 1250 1255 

gaa gae ctg ace gtt tac ttc ctg gaa cct caa aat ggg atg ttt ggc 3 844 
Glu Asp Leu Thr Val Tyr Phe Leu Glu Pro Gin Asn Gly Met Phe Gly 
1260 1265 1270 

gtt gga tgt gta tat gga agg aat gat gac egc aga ttt ggg ttc ttc 3 892 
Val Gly Cys Val Tyr Gly Arg Asn Asp Asp Arg Arg Phe Gly Phe Phe 
1275 1280 1285 

tgt cat tct get eta gag ttt ate etc cag aat gaa ttt tct cca cat 3 940 
Cys His Ser Ala Leu Glu Phe lie Leu Gin Asn Glu Phe Ser Pro His 
1290 1295 1300 

ata ata cat tgc eat gat tgg tea agt get ccg gte gee tgg eta tat 3 988 
lie lie His Cys His Asp Trp Ser Ser Ala Pro Val Ala Trp Leu Tyr 
1305 1310 1315 1320 

aag gaa eae tat tec caa tec aga atg gca age act egg gtt gta ttt 4036 
Lys Glu His Tyr Ser Gin Ser Arg Met Ala Ser Thr Arg Val Val Phe 
1325 1330 1335 
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acc ate cac aat ctt gaa ttt gga gca cat tat att ggt aaa gca atg 4084 
Thr lie His Asn Leu Glu Phe Gly Ala His Tyr lie Gly Lys Ala Met 
1340 1345 1350 

aca tac tgt gat aaa gcc aca act gtt tct cct aca tat tea agg gac 4132 
Thr Tyr Cys Asp Lys Ala Thr Thr Val Ser Pro Thr Tyr Ser Arg Asp 
1355 1360 1365 

gtg gca ggc cat ggc gcc att get cct eat egt gag aaa ttc tac ggc 4180 
Val Ala Gly His Gly Ala He Ala Pro His Arg Glu Lys Phe Tyr Gly 
1370 1375 1380 

att etc aat gga att gat cca gat ate tgg gat eeg tac act gac aat 4228 
He Leu Asn Gly He Asp Pro Asp lie Trp Asp Pro Tyr Thr Asp Asn 
1385 1390 1395 1400 

ttt ate eeg gtc cct tat act tgt gag aat gtt gte gaa ggc aag aga 4276 
Phe He Pro Val Pro Tyr Thr Cys Glu Asn Val Val Glu Gly Lys Arg 
1405 1410 1415 

get gca aaa agg gcc ttg cag cag aag ttt gga tta cag caa act gat 4324 
Ala Ala Lys Arg Ala Leu Gin Gin Lys Phe Gly Leu Gin Gin Thr Asp 
1420 1425 1430 

gtc cct att gtc gga ate ate ace egt ctg aca gee cag aag gga ate 4372 
Val Pro He Val Gly He He Thr Arg Leu Thr Ala Gin Lys Gly He 
1435 1440 1445 

cac etc ate aag cac gca att cac cga act etc gaa age aac gga cat 4420 
His Leu He Lys His Ala He His Arg Thr Leu Glu Ser Asn Gly His 
1450 1455 1460 

gtg gtt ttg ctt ggt tea get cca gat cat cga ata caa ggc gat ttt 4468 
Val Val Leu Leu Gly Ser Ala Pro Asp His Arg He Gin Gly Asp Phe 
1465 1470 1475 1480 

tgc aga ttg gee gat get ctt cat ggt gtt tac cat ggt agg gtg aag 4516 
Cys Arg Leu Ala Asp Ala Leu His Gly Val Tyr His Gly Arg Val Lys 
1485 1490 1495 

ctt gtt eta ace tat gat gag cct ctt tct cac ctg ata tac get ggc 4564 
Leu Val Leu Thr Tyr Asp Glu Pro Leu Ser His Leu He Tyr Ala Gly 
1500 1505 1510 

teg gac ttc ata att gtt cct tea ate ttc gaa cec tgt ggc tta aca 4612 
Ser Asp Phe He He Val Pro Ser He Phe Glu Pro Cys Gly Leu Thr 
1515 1520 1525 

caa ctt gtt gcc atg egt tat gga teg ate cct ata gtt egg aaa act 4660 
Gin Leu Val Ala Met Arg Tyr Gly Ser He Pro He Val Arg Lys Thr 
1530 1535 1540 

gga gga ctt cac gac aca gte ttc gac gta gac aat gat aag gac egg 4708 
Gly Gly Leu His Asp Thr Val Phe Asp Val Asp Asn Asp Lys Asp Arg 
1545 1550 1555 1560 

get egg tct ctt ggt ctt gaa cca aat ggg ttc agt ttc gac gga gee 4756 
Ala Arg Ser Leu Gly Leu Glu Pro Asn Gly Phe Ser Phe Asp Gly Ala 
1565 1570 1575 

gac age aat ggc gtg gat tat gee etc aac aga gca ate ggc get tgg 4804 
Asp Ser Asn Gly Val Asp Tyr Ala Leu Asn Arg Ala He Gly Ala Trp 
1580 1585 1590 

ttc gat gcc egt gat tgg ttc cac tec ctg tgt aag agg gte atg gag 4852 
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Phe Asp Ala Arg Asp Trp Phe His Ser Leu Cys Lys Arg Val Met Glu 
1595 1600 1605 

caa gac tgg teg tgg aac egg ccc gca ctg gac tac att gaa ttg tac 4900 
Gin Asp Trp Ser Trp Asn Arg Pro Ala Leu Asp Tyr lie Glu Leu Tyr 
1610 1615 1620 

cat gcc get cga aaa ttc tga cacccaactg aaccaatgac aagaacaagc 4951 
His Ala Ala Arg Lys Phe 
1625 1630 

gcattgtggg atcgactagt catacagggc tgtgcagatc gtcttgette agttagtgee 5011 

ctcttcagtt agtteeaagc gcaetacagt cgtacatagc tgaggatcct ettgeetect 5071 

aceaggggga acaaageaga aatgcatgag tgeattggga agaettttat gtatattgtt 5131 

aaaaaaattt ecttttcttt tcettcectg eaectggaaa tggttaagcg eategecgag 5191 

ataagaaccg cagtgacatt etgtgagtag ctttgtatat tctctcatct tgtgaaaact 5251 

aatgttcatg ttaggctgtc tgatcatgtg gaagctttgt tatatgttac ttatggtata 5311 

catcaatgat atttacattt gtggaaaaaa aaaaaaaaaa a 53 52 

<210> 8 
<211> 1630 
<212> PRT 

<213> Triticum aestivxim 

<400> 8 

Met Glu Met Ser Leu Trp Pro Arg Ser Pro Leu Cys Pro Arg Ser Arg 
15 10 15 

Gin Pro Leu Val Val Val Arg Pro Ala Gly Arg Gly Gly Leu Thr Gin 
20 25 30 

Pro Phe Leu Met Asn Gly Arg Phe Thr Arg Ser Arg Thr Leu Arg Cys 
35 40 45 

Met Val Ala Ser Ser Asp Pro Pro Asn Arg Lys Ser Arg Arg Met Val 
50 55 60 

Pro Pro Gin Val Lys Val He Ser Ser Arg Gly Tyr Thr Thr Arg Leu 
65 70 75 80 

He Val Glu Pro Ser Asn Glu Asn Thr Glu His Asn Asn Arg Asp Glu 
85 90 95 

Glu Thr Leu Asp Thr Tyr Asn Ala Leu Leu Ser Thr Glu Thr Ala Glu 
100 105 110 

Trp Thr Asp Asn Arg Glu Ala Glu Thr Ala Lys Ala Asp Ser Ser Gin 
115 120 125 

Asn Ala Leu Ser Ser Ser He lie Gly Glu Val Asp Val Ala Asp Glu 
130 135 140 

Asp He Leu Ala Ala Asp Leu Thr Val Tyr Ser Leu Ser Ser Val Met 
145 150 155 160 

Lys Lys Glu Val Asp Ala Ala Asp Lys Ala Arg Val Lys Glu Asp Ala 
165 170 175 
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Phe Glu Leu Asp Xaa Ala Ser Thr Thr Leu Arg Ser Val lie Val Asp 
180 185 190 

Val Met Asp His Xaa Trp Asp Cys Gin Glu Thr Leu Arg Ser Val lie 

195 200 205 

Val Asp Val Met Asp His Asn Gly Thr Val Gin Glu Thr Leu Arg Ser 
210 215 220 

Val lie Val Asp Val Met Asp Asp Ala Ala Asp Lys Ala Arg Val Glu 
225 230 235 240 

Glu Asp Val Phe Glu Leu Asp Leu Ser Gly Asn lie Ser Ser Ser Ala 
245 250 255 

Thr Thr Val Glu Leu Asp Ala Val Asp Glu Val Gly Pro Val Gin Asp 
260 265 270 

Lys Phe Glu Ala Thr Ser Ser Gly Asn Val Ser Asn Ser Ala Thr Val 

275 280 285 

Arg Glu Val Asp Ala Ser Asp Glu Ala Gly Asn Asp Gin Gly lie Phe 
290 295 300 

Arg Ala Asp Leu Ser Gly Asn Val Phe Ser Ser Ser Thr Thr Val Glu 
305 310 315 320 

Val Gly Ala Val Asp Glu Ala Gly Ser He Lys Asp Arg Phe Glu Thr 
325 330 335 

Asp Ser Ser Gly Asn Val Ser Thr Ser Ala Pro Met Trp Asp Ala He 
340 345 350 

Asp Glu Thr Val Ala Asp Gin Asp Thr Phe Glu Ala Asp Leu Ser Gly 

355 360 365 

Asn Ala Ser Ser Cys Ala Thr Tyr Arg Glu Val Asp Asp Val Val Asp 
370 375 380 

Glu Thr Arg Ser Glu Glu Glu Thr Phe Ala Met Asp Leu Phe Ala Ser 
385 390 395 400 

Glu Ser Gly His Glu Lys His Met Ala Val Asp Tyr Val Gly Glu Ala 
405 410 415 

Thr Asp Glu Glu Glu Thr Tyr Gin Gin Gin Tyr Pro Val Pro Ser Ser 
420 425 430 

Phe Ser Met Trp Asp Lys Ala He Ala Lys Thr Gly Val Ser Leu Asn 
435 440 445 

Pro Glu Leu Arg Leu Val Arg Val Glu Glu Gin Gly Lys Val Asn Phe 
450 455 460 

Ser Asp Lys Lys Asp Leu Ser He Asp Asp Leu Pro Gly Gin Asn Gin 

465 470 475 480 

Ser He He Gly Ser Tyr Lys Gin Asp Lys Ser He Ala Asp Val Ala 
485 490 495 

Gly Pro Thr Gin Ser He Phe Gly Ser Ser Lys Gin His Arg Ser He 
500 505 510 

Val Ala Phe Pro Lys Gin Asn Gin Ser He Val Ser Val Thr Glu Gin 
515 520 525 
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Lys Gin Ser lie Val Gly Phe Arg Ser Gin Asp Leu Ser Ala Val Ser 
530 535 540 

Leu Pro Lys Gin Asn Val Pro lie Val Gly Tyr Val Glu Arg Gly Ser 
545 550 555 560 

Asn Xaa Lys Gin Val Pro Val Val Asp Arg Gin Asp Ala Leu Tyr Val 

565 570 575 

Asn Gly Leu Glu Ala Lys Glu Gly Asp His Thr Ser Glu Lys Thr Asp 
580 585 590 

Glu Asp Ala Leu His Val Lys Phe Asn Val Asp Asn Val Leu Arg Lys 
595 600 605 

His Gin Ala Asp Arg Thr Gin Ala Val Glu Lys Lys Thr Trp Lys Lys 
610 615 620 

Val Asp Glu Glu His Leu Tyr Met Thr Glu His Gin Lys Arg Ala Ala 
625 630 635 640 

Glu Gly Gin Met Val Val Asn Glu Asp Glu Leu Ser lie Thr Glu lie 

645 650 655 

Gly Met Gly Arg Gly Asp Lys lie Gin His Val Leu Ser Glu Glu Glu 
660 665 670 

Leu Ser Trp Ser Glu Asp Glu Val Gin Leu lie Glu Asp Asp Gly Gin 
675 680 685 

Tyr Glu Val Asp Glu Thr Ser Val Ser Val Asn Val Glu Gin Asp lie 
690 695 700 

Gin Gly Ser Pro Gin Asp Val Val Asp Pro Gin Ala Leu Lys Val Met 
705 710 715 720 

Leu Gin Glu Leu Ala Glu Lys Asn Tyr Ser Met Arg Asn Lys Leu Phe 

725 730 735 

Val Phe Pro Glu Val Val Lys Ala Asp Ser Val lie Asp Leu Tyr Leu 
740 745 750 

Asn Arg Asp Leu Thr Ala Leu Ala Asn Glu Pro Asp Val Val lie Lys 
755 760 765 

Gly Ala Phe Asn Gly Trp Lys Trp Arg Leu Phe Thr Glu Arg Leu His 
770 775 780 

Lys Ser Asp Leu Gly Gly Val Trp Trp Ser Cys Lys Leu Tyr lie Pro 
785 790 795 800 

Lys Glu Ala Tyr Arg Leu Asp Phe Val Phe Phe Asn Gly Arg Thr Val 
805 810 815 

Tyr Glu Asn Asn Gly Asn Asn Asp Phe Cys He Gly He Glu Gly Thr 
820 825 830 

Met Asn Glu Asp Leu Phe Glu Asp Phe Leu Val Lys Glu Lys Gin Arg 
835 840 845 

Glu Leu Glu Lys Leu Ala Met Glu Glu Ala Glu Arg Arg Thr Gin Thr 
850 855 860 

Glu Glu Gin Arg Arg Arg Lys Glu Ala Arg Ala Ala Asp Glu Ala Val 
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865 870 875 880 

Arg Ala Gin Ala Lys Ala Glu lie Glu lie Lys Lys Lys Lys Leu Gin 
885 890 895 

Ser Met Leu Ser Leu Ala Arg Thr Cys Val Asp Asn Leu Trp Tyr lie 

900 905 910 

Glu Ala Ser Thr Asp Thr Arg Gly Asp Thr lie Arg Leu Tyr Tyr Asn 
915 920 925 

Arg Asn Ser Arg Pro Leu Ala His Ser Thr Glu lie Trp Met His Gly 
930 935 940 

Gly Tyr Asn Asn Trp Thr Asp Gly Leu Ser lie Val Glu Ser Phe Val 
945 950 955 960 

Lys Cys Asn Asp Lys Asp Gly Asp Trp Trp Tyr Ala Asp Val lie Pro 
965 970 975 

Pro Glu Lys Ala Leu Val Leu Asp Trp Val Phe Ala Asp Gly Pro Ala 
980 985 990 

Gly Asn Ala Arg Asn Tyr Asp Asn Asn Ala Arg Gin Asp Phe His Ala 
995 1000 1005 

lie Leu Pro Asn Asn Asn Val Thr Glu Glu Gly Phe Trp Ala Gin Glu 
1010 1015 1020 

Glu Gin Asn He Tyr Thr Arg Leu Leu Gin Glu Arg Arg Glu Lys Glu 
025 1030 1035 1040 

Glu Thr Met Lys Arg Lys Ala Glu Arg Ser Ala Asn He Lys Ala Glu 
1045 1050 1055 

Met Lys Ala Lys Thr Met Arg Arg Phe Leu Leu Ser Gin Lys His He 
1060 1065 1070 

Val Tyr Thr Arg Thr Xaa Leu Lys Tyr Val Pro Gly Thr Thr Val Asp 
1075 1080 1085 

Val Leu Tyr Asn Pro Ser Asn Thr Val Leu Asn Gly Lys Ser Glu Gly 
1090 1095 1100 

Trp Phe Arg Cys Ser Phe Asn Leu Trp Met His Ser Ser Gly Ala Leu 
105 1110 1115 1120 

Pro Pro Gin Lys Met Val Lys Ser Gly Asp Gly Pro Leu Leu Lys Ala 
1125 1130 1135 

Thr Val Asp Val Pro Pro Asp Ala Tyr Met Met Asp Phe Val Phe Ser 
1140 1145 1150 

Glu Trp Glu Glu Asp Gly He Tyr Asp Asn Arg Asn Gly Met Asp Tyr 
1155 1160 1165 

His He Pro Val Ser Asp Ser He Glu Thr Glu Asn Tyr Met Arg He 
1170 1175 1180 

He His He Ala Val Glu Met Ala Pro Val Ala Lys Val Gly Gly Leu 
185 1190 1195 1200 



Gly Asp Val Val Thr Ser Leu Ser Arg Ala He Gin Asp Leu Gly His 
1205 1210 1215 
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Thr Val Glu Val lie Leu Pro Lys Tyr Asp Cys Leu Asn Gin Ser Ser 
1220 1225 1230 

Val Lys Asp Leu His Leu Tyr Gin Ser Phe Ser Trp Gly Gly Thr Glu 
1235 1240 1245 

lie Lys Val Trp Val Gly Arg Val Glu Asp Leu Thr Val Tyr Phe Leu 
1250 1255 1260 

Glu Pro Gin Asn Gly Met Phe Gly Val Gly Cys Val Tyr Gly Arg Asn 
265 1270 1275 1280 

Asp Asp Arg Arg Phe Gly Phe Phe Cys His Ser Ala Leu Glu Phe lie 
1285 1290 1295 

Leu Gin Asn Glu Phe Ser Pro His lie lie His Cys His Asp Trp Ser 
1300 1305 1310 

Ser Ala Pro Val Ala Trp Leu Tyr Lys Glu His Tyr Ser Gin Ser Arg 
1315 1320 1325 

Met Ala Ser Thr Arg Val Val Phe Thr lie His Asn Leu Glu Phe Gly 
1330 1335 1340 

Ala His Tyr lie Gly Lys Ala Met Thr Tyr Cys Asp Lys Ala Thr Thr 
345 1350 1355 1360 

Val Ser Pro Thr Tyr Ser Arg Asp Val Ala Gly His Gly Ala He Ala 

1365 1370 1375 

Pro His Arg Glu Lys Phe Tyr Gly He Leu Asn Gly He Asp Pro Asp 
1380 1385 1390 

He Trp Asp Pro Tyr Thr Asp Asn Phe He Pro Val Pro Tyr Thr Cys 
1395 1400 1405 

Glu Asn Val Val Glu Gly Lys Arg Ala Ala Lys Arg Ala Leu Gin Gin 
1410 1415 1420 

Lys Phe Gly Leu Gin Gin Thr Asp Val Pro He Val Gly He He Thr 
425 1430 1435 1440 

Arg Leu Thr Ala Gin Lys Gly He His Leu He Lys His Ala He His 
1445 1450 1455 

Arg Thr Leu Glu Ser Asn Gly His Val Val Leu Leu Gly Ser Ala Pro 
1460 1465 1470 

Asp His Arg He Gin Gly Asp Phe Cys Arg Leu Ala Asp Ala Leu His 
1475 1480 1485 

Gly Val Tyr His Gly Arg Val Lys Leu Val Leu Thr Tyr Asp Glu Pro 
1490 1495 1500 

Leu Ser His Leu He Tyr Ala Gly Ser Asp Phe He He Val Pro Ser 
505 1510 1515 1520 

He Phe Glu Pro Cys Gly Leu Thr Gin Leu Val Ala Met Arg Tyr Gly 
1525 1530 1535 

Ser He Pro He Val Arg Lys Thr Gly Gly Leu His Asp Thr Val Phe 
1540 1545 1550 

Asp Val Asp Asn Asp Lys Asp Arg Ala Arg Ser Leu Gly Leu Glu Pro 
1555 1560 1565 
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Asn Gly Phe Ser Phe Asp Gly Ala 
1570 1575 

Leu Asn Arg Ala lie Gly Ala Trp 
585 1590 

Ser Leu Cys Lys Arg Val Met Glu 
1605 



Asp Ser Asn Gly Val Asp Tyr Ala 
1580 

Phe Asp Ala Arg Asp Trp Phe His 
1595 1600 

Gin Asp Trp Ser Trp Asn Arg Pro 
1610 1615 



Ala Leu Asp Tyr lie Glu Leu Tyr His Ala Ala Arg Lys Phe 
1620 1625 1630 



<210> 9 
<211> 3621 
<212> DNA 

<213> Triticum aestivum 

<220> 

<221> CDS 

<222> (1) . . (3180) 

<400> 9 

gat gca ttg tat gtg aat gga ctg gaa get aag gag gga gat cac aca 48 
Asp Ala Leu Tyr Val Asn Gly Leu Glu Ala Lys Glu Gly Asp His Thr 
15 10 15 

tec gag aaa act gat gag gat gcg ctt cat gta aag ttt aat gtt gac 96 
Ser Glu Lys Thr Asp Glu Asp Ala Leu His Val Lys Phe Asn Val Asp 
20 25 30 

aat gtg ttg egg aag cat cag gca gat aga acc caa gca gtg gaa aag 144 
Asn Val Leu Arg Lys His Gin Ala Asp Arg Thr Gin Ala Val Glu Lys 
35 40 45 

aaa act tgg aag aaa gtt gat gag gaa cat ctt tac atg act gaa cat 192 
Lys Thr Trp Lys Lys Val Asp Glu Glu His Leu Tyr Met Thr Glu His 
50 55 60 

cag aaa cgt get gcc gaa gga cag atg gta gtt aac gag gat gag ctt 24 0 
Gin Lys Arg Ala Ala Glu Gly Gin Met Val Val Asn Glu Asp Glu Leu 
65 70 75 80 

tct ata act gaa att gga atg ggg aga ggt gat aaa att cag cat gtg 2 88 
Ser lie Thr Glu He Gly Met Gly Arg Gly Asp Lys He Gin His Val 
85 90 95 

ctt tct gag gaa gag ctt tea tgg tct gaa gat gaa gtg cag tta att 336 
Leu Ser Glu Glu Glu Leu Ser Trp Ser Glu Asp Glu Val Gin Leu He 
100 105 110 

gag gat gat gga caa tat gaa gtt gac gag acc tct gtg tec gtt aac 384 
Glu Asp Asp Gly Gin Tyr Glu Val Asp Glu Thr Ser Val Ser Val Asn 
115 120 125 

gtt gaa caa gat ate cag ggg tea oca cag gat gtt gtg gat ccg caa 432 
Val Glu Gin Asp He Gin Gly Ser Pro Gin Asp Val Val Asp Pro Gin 
130 135 140 

gca eta aag gtg atg ctg caa gaa etc get gag aaa aat tat teg atg 480 
Ala Leu Lys Val Met Leu Gin Glu Leu Ala Glu Lys Asn Tyr Ser Met 
145 150 155 160 



agg aac aag ctg ttt gtt ttt cca gag gta gtg aaa get gat tea gtt 528 
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Arg Asn Lys Leu Phe Val Phe Pro Glu Val Val Lys Ala Asp Ser Val 
165 170 175 

att gat ctt tat tta aat cgt gac eta aca get ttg gcg aat gaa ccc 576 
lie Asp Leu Tyr Leu Asn Arg Asp Leu Thr Ala Leu Ala Asn Glu Pro 
180 185 190 

gat gtc gtc ate aaa gga gca tte aat ggt tgg aaa tgg agg ctt ttc 624 
Asp Val Val lie Lys Gly Ala Phe Asn Gly Trp Lys Trp Arg Leu Phe 
195 200 205 

act gaa aga ttg cac aag agt gac ctt gga ggg gtt tgg tgg tct tgc 672 
Thr Glu Arg Leu His Lys Ser Asp Leu Gly Gly Val Trp Trp Ser Cys 
210 215 220 



aaa etg tac ata cee aag gag gee tac aga tta gac ttt gtg ttc ttc 
Lys Leu Tyr lie Pro Lys Glu Ala Tyr Arg Leu Asp Phe Val Phe Phe 
225 230 235 240 



gga ata gaa ggc act atg aat gaa gat ctg ttt gag gat ttc ttg gtt 
Gly He Glu Gly Thr Met Asn Glu Asp Leu Phe Glu Asp Phe Leu Val 
260 265 270 



gca gat gaa get gtc agg gca caa gcg aag gcc gag ata gag ate aag 
Ala Asp Glu Ala Val Arg Ala Gin Ala Lys Ala Glu He Glu He Lys 
305 310 315 320 



720 



aac ggt cgc acg gtc tat gag aac aat ggc aac aat gat ttc tgt ata 768 
Asn Gly Arg Thr Val Tyr Glu Asn Asn Gly Asn Asn Asp Phe Cys He 
245 250 255 



816 



aaa gaa aag caa agg gag ctt gag aaa ctt gcc atg gaa gaa get gaa 864 
Lys Glu Lys Gin Arg Glu Leu Glu Lys Leu Ala Met Glu Glu Ala Glu 
275 280 285 

^S9 ^99 cag act gaa gaa cag egg cga aga aag gaa gca agg get 912 

Arg Arg Thr Gin Thr Glu Glu Gin Arg Arg Arg Lys Glu Ala Arg Ala 
290 295 300 



960 



aag aaa aaa ttg caa agt atg ttg agt ttg gcc aga aca tgt gtt gat 1008 
Lys Lys Lys Leu Gin Ser Met Leu Ser Leu Ala Arg Thr Cys Val Asp 
325 330 335 

aat ttg tgg tac ata gag get age aca gat aca aga gga gat act ate 1056 
Asn Leu Trp Tyr He Glu Ala Ser Thr Asp Thr Arg Gly Asp Thr He 
340 345 350 

agg tta tat tat aac aga aac teg agg cea ctt gcg cat agt act gag 1104 
Arg Leu Tyr Tyr Asn Arg Asn Ser Arg Pro Leu Ala His Ser Thr Glu 
355 360 365 

att tgg atg cat ggt ggt tac aac aat tgg tea gat gga etc tct att 1152 
He Trp Met His Gly Gly Tyr Asn Asn Trp Ser Asp Gly Leu Ser He 
370 375 380 

gtt gaa age ttt gtc aag tgc aat gac aaa gac ggc gat tgg tgg tat 1200 
Val Glu Ser Phe Val Lys Cys Asn Asp Lys Asp Gly Asp Trp Trp Tyr 
385 390 395 400 

gca gat gtt att cca cet gaa aag gca ctt gtg ttg gac tgg gtt ttt 1248 
Ala Asp Val He Pro Pro Glu Lys Ala Leu Val Leu Asp Trp Val Phe 
405 410 415 

get gat ggg cca get ggg aat gca agg aac tat gac aac aat get cga 1296 
Ala Asp Gly Pro Ala Gly Asn Ala Arg Asn Tyr Asp Asn Asn Ala Arg 
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420 425 430 

caa gat ttc cat get att ctt ccg aac aac aat gta acc gag gaa ggc 1344 
Gin Asp Phe His Ala lie Leu Pro Asn Asn Asn Val Thr Glu Glu Gly 
435 440 445 

ttc tgg gcg caa gag gag caa aac ate tat aca agg ctt ctg caa gaa 13 92 
Phe Trp Ala Gin Glu Glu Gin Asn lie Tyr Thr Arg Leu Leu Gin Glu 
450 455 460 

agg aga gaa aag gaa gaa acc atg aaa aga aag get gag aga agt gca 1440 
Arg Arg Glu Lys Glu Glu Thr Met Lys Arg Lys Ala Glu Arg Ser Ala 
465 470 475 480 

aat ate aaa get gag atg aag gca aaa act atg cga agg ttt ctg ctt 1488 
Asn He Lys Ala Glu Met Lys Ala Lys Thr Met Arg Arg Phe Leu Leu 
485 490 495 

tec cag aaa cac att gtt tat acc cga acc gnc ttg aaa tac gtg cce 1536 
Ser Gin Lys His He Val Tyr Thr Arg Thr Xaa Leu Lys Tyr Val Pro 
500 505 510 

gga acc aca gtg gat gtg eta tac aat cce tct aac aca gtg eta aat 1584 
Gly Thr Thr Val Asp Val Leu Tyr Asn Pro Ser Asn Thr Val Leu Asn 
515 520 525 

gga aag teg gag ggt tgg ttt aga tgc tec ttt aac ctt tgg atg cat 1632 
Gly Lys Ser Glu Gly Trp Phe Arg Cys Ser Phe Asn Leu Trp Met His 
530 535 540 

tea agt ggg gca ttg cca cce cag aag atg gtg aaa tea ggg gat ggg 1680 
Ser Ser Gly Ala Leu Pro Pro Gin Lys Met Val Lys Ser Gly Asp Gly 
545 550 555 560 

ccg etc tta aaa gca aca gtt gat gtt cca ccg gat gee tat atg atg 1728 
Pro Leu Leu Lys Ala Thr Val Asp Val Pro Pro Asp Ala Tyr Met Met 
565 570 575 

gac ttt gtt ttc tec gag tgg gaa gaa gat ggg ate tat gac aac agg 1776 
Asp Phe Val Phe Ser Glu Trp Glu Glu Asp Gly He Tyr Asp Asn Arg 
580 585 590 

aat ggg atg gac tat cat att cet gtt tct gat tea att gaa aca gag 1824 
Asn Gly Met Asp Tyr His He Pro Val Ser Asp Ser He Glu Thr Glu 
595 600 605 

aat tac atg cgt att ate cac att gcc gtt gag atg gee cce gtt gca 1872 
Asn Tyr Met Arg He He His He Ala Val Glu Met Ala Pro Val Ala 
610 615 620 

aag gtt gga ggt ctt ggg gat gtt gtt aca agt ctt tea cgt gcc att 1920 
Lys Val Gly Gly Leu Gly Asp Val Val Thr Ser Leu Ser Arg Ala He 
625 630 635 640 

caa gat eta gga cat act gtc gag gtt att etc ccg aag tac gac tgt 1968 
Gin Asp Leu Gly His Thr Val Glu Val He Leu Pro Lys Tyr Asp Cys 
645 650 655 

ttg aac caa age agt gtc aag gat tta cat tta tat caa agt ttt tct 2016 
Leu Asn Gin Ser Ser Val Lys Asp Leu His Leu Tyr Gin Ser Phe Ser 
660 665 670 

tgg ggt ggt aca gaa ata aaa gta tgg gtt gga cga gtc gaa gac ctg 2064 
Trp Gly Gly Thr Glu He Lys Val Trp Val Gly Arg Val Glu Asp Leu 
675 680 685 
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acc gtt tac ttc ctg gaa cct caa aat ggg atg ttt ggc gtt gga tgt 2112 
Thr Val Tyr Phe Leu Glu Pro Gin Asn Gly Met Phe Gly Val Gly Cys 
690 695 700 



gta tat gga agg aat gat gac cgc aga ttt ggg ttc ttc tgt cat tot 

Val Tyr Gly Arg Asn Asp Asp Arg Arg Phe Gly Phe Phe Cys His Ser 

705 710 715 720 

get eta gag ttt ate etc cag aat gaa ttt tct eca eat ata ata cat 

Ala Leu Glu Phe lie Leu Gin Asn Glu Phe Ser Pro His lie lie His 

725 730 735 



gat aaa gcc aca act gtt tct cct aca tat tea agg gac gtg gca ggc 
Asp Lys Ala Thr Thr Val Ser Pro Thr Tyr Ser Arg Asp Val Ala Gly 
785 790 795 800 



2160 



2208 



tgc cat gat tgg tea agt get ccg gtc gcc tgg eta tat aag gaa cac 2256 
Cys His Asp Trp Ser Ser Ala Pro Val Ala Trp Leu Tyr Lys Glu His 
740 745 750 

tat tee caa tec aga atg gca age act egg gtt gta ttt ace ate cac 2304 
Tyr Ser Gin Ser Arg Met Ala Ser Thr Arg Val Val Phe Thr lie His 
755 760 765 

aat ctt gaa ttt gga gca cat tat att ggt aaa gca atg aca tac tgt 2352 
Asn Leu Glu Phe Gly Ala His Tyr lie Gly Lys Ala Met Thr Tyr Cys 
770 775 780 



2400 



cat ggc gcc att get cct cat cgt gag aaa ttc tac ggc att etc aat 244 8 
His Gly Ala lie Ala Pro His Arg Glu Lys Phe Tyr Gly He Leu Asn 
805 810 815 

gga att gat eca gat ate tgg gat ccg tac act gac aat ttt ate ccg 2496 
Gly He Asp Pro Asp He Trp Asp Pro Tyr Thr Asp Asn Phe He Pro 
820 825 830 

gtc cct tat act tgt gag aat gtt gtc gaa ggc aag agg get gca aaa 2544 
Val Pro Tyr Thr Cys Glu Asn Val Val Glu Gly Lys Arg Ala Ala Lys 
835 840 845 

agg gee ttg cag cag aag ttt gga tta cag caa act gat gtc cct att 2592 
Arg Ala Leu Gin Gin Lys Phe Gly Leu Gin Gin Thr Asp Val Pro He 
850 855 860 

gtc gga ate ate ace cgt ctg aca gca cag aag gga ate cac etc ate 264 0 
Val Gly He He Thr Arg Leu Thr Ala Gin Lys Gly He His Leu He 
865 870 875 880 

aag cac gca att cac ega ace etc gag age aat gga caa gtg gtt ttg 2688 
Lys His Ala He His Arg Thr Leu Glu Ser Asn Gly Gin Val Val Leu 
885 890 895 

ctt ggt tea get cea gat cat ega ata caa ggc gat ttt tgc aga ttg 2736 
Leu Gly Ser Ala Pro Asp His Arg He Gin Gly Asp Phe Cys Arg Leu 
900 905 910 

gcc gat get ctt cac ggt gtt tac cat ggt agg gtg aag ctt gtt eta 2784 
Ala Asp Ala Leu His Gly Val Tyr His Gly Arg Val Lys Leu Val Leu 
915 920 925 

ace tac gat gag cct ctt tct cac ctg ata tac get ggc tec gac ttc 2832 
Thr Tyr Asp Glu Pro Leu Ser His Leu He Tyr Ala Gly Ser Asp Phe 
930 935 940 
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att att gtc cct tea ate ttt gaa ccc tgt ggc tta aca caa ctt gtt 2880 
lie He Val Pro Ser He Phe Glu Pro Cys Gly Leu Thr Gin Leu Val 
945 950 955 960 

gcc atg cgt tat gga teg ate cct ata gtt egg aaa ace gga gga ett 2928 
Ala Met Arg Tyr Gly Ser He Pro He Val Arg Lys Thr Gly Gly Leu 
965 970 975 

tac gae act gtc tte gae gta gae aat gat aag gac egg get egg tet 2976 
Tyr Asp Thr Val Phe Asp Val Asp Asn Asp Lys Asp Arg Ala Arg Ser 
980 985 990 

ctt ggt ctt gaa cca aat ggg ttc agt ttc gac gga gcc gac age aat 3024 
Leu Gly Leu Glu Pro Asn Gly Phe Ser Phe Asp Gly Ala Asp Ser Asn 
995 1000 1005 

ggc gtg gat tat gcc etc aae aga gea ate ggc get tgg ttc gat gee 3072 
Gly Val Asp Tyr Ala Leu Asn Arg Ala He Gly Ala Trp Phe Asp Ala 
1010 1015 1020 

cgt gat tgg ttc cac tec ctg tgt aag agg gtc atg gag caa gac tgg 3120 
Arg Asp Trp Phe His Ser Leu Cys Lys Arg Val Met Glu Gin Asp Trp 
1025 1030 1035 1040 

teg tgg aae egg cct gca ctg gae tac att gaa ttg tac cat gee get 3168 
Ser Trp Asn Arg Pro Ala Leu Asp Tyr He Glu Leu Tyr His Ala Ala 
1045 1050 1055 

cga aaa ttc tga cacccaactg aaccaatgge aagaacaage gcattgtggg 3220 
Arg Lys Phe 





1060 












atcgaetaca 


gtcatacagg 


gctgtgcaga 


tegtcttget 


tcagttagtg 


ecctcttcag 


3280 


ttagttccaa 


gcgeactaca 


gtcgtacata 


gctgaggatc 


ctettgeetc 


cteeaceagg 


3340 


ggaaacaaag 


cagaaatgca 


taagtgeatt 


gggaagactt 


ttatgtatat 


tgttaaattt 


3400 


ttccttttct 


tttecttcec 


tgcaectgga 


aatggttaag 


cgcategeeg 


agataagaae 


3460 


cacagtaaea 


ttctgtgagt 


agctttgtat 


attetctcat 


cttgtgaaaa 


ctaatgtgea 


3520 


tgttaggctc 


tetgateatg 


tggaagcttt 


gttatatgtt 


acttatggtt 


atatggtata 


3580 


cateaatgat 


atttacattt 


gtggaaaaaa 


aaaaaaaaaa 


a 




3621 



<210> 10 
<211> 1059 
<212> PRT 

<213> Triticum aestivum 
<400> 10 

Asp Ala Leu Tyr Val Asn Gly Leu Glu Ala Lys Glu Gly Asp His Thr 
15 10 15 

Ser Glu Lys Thr Asp Glu Asp Ala Leu His Val Lys Phe Asn Val Asp 

20 25 30 

Asn Val Leu Arg Lys His Gin Ala Asp Arg Thr Gin Ala Val Glu Lys 
35 40 45 

Lys Thr Trp Lys Lys Val Asp Glu Glu His Leu Tyr Met Thr Glu His 
50 55 60 
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Gin Lys Arg Ala Ala Glu Gly Gin Met Val Val Asn Glu Asp Glu Leu 
65 70 75 80 

Ser lie Thr Glu lie Gly Met Gly Arg Gly Asp Lys lie Gin His Val 
85 90 95 

Leu Ser Glu Glu Glu Leu Ser Trp Ser Glu Asp Glu Val Gin Leu lie 
100 105 110 

Glu Asp Asp Gly Gin Tyr Glu Val Asp Glu Thr Ser Val Ser Val Asn 
115 120 125 

Val Glu Gin Asp lie Gin Gly Ser Pro Gin Asp Val Val Asp Pro Gin 
130 135 140 

Ala Leu Lys Val Met Leu Gin Glu Leu Ala Glu Lys Asn Tyr Ser Met 
145 150 155 160 

Arg Asn Lys Leu Phe Val Phe Pro Glu Val Val Lys Ala Asp Ser Val 

165 170 175 

lie Asp Leu Tyr Leu Asn Arg Asp Leu Thr Ala Leu Ala Asn Glu Pro 
180 185 190 

Asp Val Val lie Lys Gly Ala Phe Asn Gly Trp Lys Trp Arg Leu Phe 
195 200 205 

Thr Glu Arg Leu His Lys Ser Asp Leu Gly Gly Val Trp Trp Ser Cys 
210 215 220 

Lys Leu Tyr lie Pro Lys Glu Ala Tyr Arg Leu Asp Phe Val Phe Phe 
225 230 235 240 

Asn Gly Arg Thr Val Tyr Glu Asn Asn Gly Asn Asn Asp Phe Cys lie 

245 250 255 

Gly lie Glu Gly Thr Met Asn Glu Asp Leu Phe Glu Asp Phe Leu Val 
260 265 270 

Lys Glu Lys Gin Arg Glu Leu Glu Lys Leu Ala Met Glu Glu Ala Glu 
275 280 285 

Arg Arg Thr Gin Thr Glu Glu Gin Arg Arg Arg Lys Glu Ala Arg Ala 
290 295' 300 

Ala Asp Glu Ala Val Arg Ala Gin Ala Lys Ala Glu lie Glu He Lys 
305 310 315 320 

Lys Lys Lys Leu Gin Ser Met Leu Ser Leu Ala Arg Thr Cys Val Asp 

325 330 335 

Asn Leu Trp Tyr He Glu Ala Ser Thr Asp Thr Arg Gly Asp Thr He 
340 345 350 

Arg Leu Tyr Tyr Asn Arg Asn Ser Arg Pro Leu Ala His Ser Thr Glu 
355 360 365 

He Trp Met His Gly Gly Tyr Asn Asn Trp Ser Asp Gly Leu Ser He 
370 375 380 

Val Glu Ser Phe Val Lys Cys Asn Asp Lys Asp Gly Asp Trp Trp Tyr 
385 390 395 400 



Ala Asp Val He Pro Pro Glu Lys Ala Leu Val Leu Asp Trp Val Phe 
405 410 415 
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Ala Asp Gly Pro Ala Gly Asn Ala Arg Asn Tyr Asp Asn Asn Ala Arg 
420 425 430 

Gin Asp Phe His Ala He Leu Pro Asn Asn Asn Val Thr Glu Glu Gly 
435 440 445 

Phe Trp Ala Gin Glu Glu Gin Asn He Tyr Thr Arg Leu Leu Gin Glu 
450 455 460 

Arg Arg Glu Lys Glu Glu Thr Met Lys Arg Lys Ala Glu Arg Ser Ala 
465 470 475 480 

Asn He Lys Ala Glu Met Lys Ala Lys Thr Met Arg Arg Phe Leu Leu 
485 490 495 

Ser Gin Lys His He Val Tyr Thr Arg Thr Xaa Leu Lys Tyr Val Pro 
500 505 510 

Gly Thr Thr Val Asp Val Leu Tyr Asn Pro Ser Asn Thr Val Leu Asn 
515 520 525 

Gly Lys Ser Glu Gly Trp Phe Arg Cys Ser Phe Asn Leu Trp Met His 
530 535 540 

Ser Ser Gly Ala Leu Pro Pro Gin Lys Met Val Lys Ser Gly Asp Gly 
545 550 555 560 

Pro Leu Leu Lys Ala Thr Val Asp Val Pro Pro Asp Ala Tyr Met Met 
565 570 575 

Asp Phe Val Phe Ser Glu Trp Glu Glu Asp Gly He Tyr Asp Asn Arg 
580 585 590 

Asn Gly Met Asp Tyr His He Pro Val Ser Asp Ser He Glu Thr Glu 
595 600 605 

Asn Tyr Met Arg He He His He Ala Val Glu Met Ala Pro Val Ala 
610 615 620 

Lys Val Gly Gly Leu Gly Asp Val Val Thr Ser Leu Ser Arg Ala He 
625 630 635 640 

Gin Asp Leu Gly His Thr Val Glu Val He Leu Pro Lys Tyr Asp Cys 
645 650 655 

Leu Asn Gin Ser Ser Val Lys Asp Leu His Leu Tyr Gin Ser Phe Ser 
660 665 670 

Trp Gly Gly Thr Glu He Lys Val Trp Val Gly Arg Val Glu Asp Leu 
675 680 685 

Thr Val Tyr Phe Leu Glu Pro Gin Asn Gly Met Phe Gly Val Gly Cys 
690 695 700 

Val Tyr Gly Arg Asn Asp Asp Arg Arg Phe Gly Phe Phe Cys His Ser 
705 710 715 720 

Ala Leu Glu Phe He Leu Gin Asn Glu Phe Ser Pro His He He His 
725 730 735 

Cys His Asp Trp Ser Ser Ala Pro Val Ala Trp Leu Tyr Lys Glu His 

740 745 750 



Tyr Ser Gin Ser Arg Met Ala Ser Thr Arg Val Val Phe Thr He His 
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755 760 765 

Asn Leu Glu Phe Gly Ala His Tyr lie Gly Lys Ala Met Thr Tyr Cys 
770 775 780 

Asp Lys Ala Thr Thr Val Ser Pro Thr Tyr Ser Arg Asp Val Ala Gly 
785 790 795 800 

His Gly Ala lie Ala Pro His Arg Glu Lys Phe Tyr Gly lie Leu Asn 
805 810 815 

Gly lie Asp Pro Asp lie Trp Asp Pro Tyr Thr Asp Asn Phe lie Pro 
820 825 830 

Val Pro Tyr Thr Cys Glu Asn Val Val Glu Gly Lys Arg Ala Ala Lys 

835 840 845 

Arg Ala Leu Gin Gin Lys Phe Gly Leu Gin Gin Thr Asp Val Pro lie 
850 855 860 

Val Gly lie He Thr Arg Leu Thr Ala Gin Lys Gly He His Leu He 
865 870 875 880 

Lys His Ala He His Arg Thr Leu Glu Ser Asn Gly Gin Val Val Leu 
885 890 895 

Leu Gly Ser Ala Pro Asp His Arg He Gin Gly Asp Phe Cys Arg Leu 
900 905 910 

Ala Asp Ala Leu His Gly Val Tyr His Gly Arg Val Lys Leu Val Leu 
915 920 925 

Thr Tyr Asp Glu Pro Leu Ser His Leu He Tyr Ala Gly Ser Asp Phe 
930 935 940 

He He Val Pro Ser He Phe Glu Pro Cys Gly Leu Thr Gin Leu Val 
945 950 955 960 

Ala Met Arg Tyr Gly Ser He Pro He Val Arg Lys Thr Gly Gly Leu 
965 970 975 

Tyr Asp Thr Val Phe Asp Val Asp Asn Asp Lys Asp Arg Ala Arg Ser 

980 985 990 

Leu Gly Leu Glu Pro Asn Gly Phe Ser Phe Asp Gly Ala Asp Ser Asn 
995 1000 1005 

Gly Val Asp Tyr Ala Leu Asn Arg Ala He Gly Ala Trp Phe Asp Ala 
1010 1015 1020 

Arg Asp Trp Phe His Ser Leu Cys Lys Arg Val Met Glu Gin Asp Trp 
025 1030 1035 1040 

Ser Trp Asn Arg Pro Ala Leu Asp Tyr He Glu Leu Tyr His Ala Ala 
1045 1050 1055 

Arg Lys Phe 



<210> 11 

<211> 728 

<212> DNA 

<213> Triticum sp. 



<400> 11 
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gatcttgaac 


ggcacgtgaa 


agacttgtaa 


caacatcccc 


gagacctcca 


acctatgaga 


60 


tcatcgatca 


tgacagagca 


tagtattatg 


gcatagaatg 


aaaaaaaggc 


ataaggtgat 


120 


gagatctcca 


cgccagagcg 


ttgtattcca 


attttagttc 


tttccccgtg 


aggaggggag 


180 


gctaggcggg 


cgaggcagag 


gggatagggc 


agtcgccgct 


gcgtggtgga 


ctgactggtg 


240 


tggtgggtgg 


tgggttttgc 


gggcggggtt 


tagtaggttc 


ccggaaatgg 


agatggctct 


300 


ccggccacgg 


agccctctgt 


gccctcggag 


cagtcagccg 


ctcgtcgtcg 


tccggccggc 


360 


cggccgcggc 


ggcggcctcg 


cgcaggtacg ggtgattatg 


gttcttgatt 


cggtcggttc 


420 


acggaatgtt 


gtttgatttg 


gttctgtccc 


gggtcaggtt 


catagtgatt 


ttattccgca 


480 


aaaaaaaaag 


gtttatagtg 


attttgattt 


ctttcatctc 


gggaacattt 


ttatatctgg 


540 


gagtcaaagg 


gcattggttt 


tgatttgcat 


gcggaacata 


ttggttattt 


attaatgtgg 


600 


tgagctggaa 


ttcatactgc 


ttaaaacgac 


gtgattttaa 


ttgctggaag 


aggtaaagaa 


660 


catgaattct 


tgttatattt 


gttaaaaaaa 


atcccctgtt 


ctagcgtttc 


aatctgcatg 


720 


atcatgga 












728 


<210> 12 

<211> 2446 

<212> DNA 

<213> Triticura sp. 












<400> 12 
gtgggtctat 


aaaagacagg 


tttgagcgga 


ttcgtcagga 


aatgtttcaa 


caagtgcgac 


60 


gatgtgggat 


gcaattgatg 


aaaccgtggc 


ttgatcaaga 


cgcagttgag 


gcggatttgt 


120 


cgggaaatgc 


ttcaagctgc 


gcgacataca 


gagaagtgga 


tgatgtggtg 


gatgaaacta 


180 


gatcagaaga 


ggaaacattt 


gcgatggatt 


tgtttgcaag 


tgaatcaggc 


catgagaaac 


240 


atatggcagt 


ggatcatgtg ggtgaagcta 


ccgatgaaga 


agagacttac 


caacagcaat 


300 


atccagtacc 


gtcttcattc 


tctatgtggg 


acaaggctat 


tgctaaaaca 


ggtgtaagtt 


360 


tgaatcctga 


gctgcgactt 


gtcagggttg 


aagaacaagg 


caaagtaaat 


tttagtgata 


420 


aaaaagacct 


gtcaattgat 


gatttaccag 


gacaaaacca 


atcgatcatt 


ggttcctata 


480 


aacaagataa 


atcaattgct 


gatgttgcgg 


gaccgaccca 


atcaatttct 


ggttctagta 


540 


aacaacaccg 


gtcaattgtt 


gctttcccca 


aacaaaacca 


gtcaattgtt 


agtgtcactg 


600 


agcaaaagca 


gtccatagtt 


ggattccgta 


gtcaagatct 


ttcggctgtt 


agtctcccta 


660 


aacaaaacgt 


accaattgtt 


ggtacgtcga 


gagagggtca 


aacaaagcaa 


gttcctgttg 


720 


ttgatagaca 


ggatgcgttg 


tatgtgaatg 


gactggaagc 


taaggaggga 


gatcacacat 


780 


ccgagaaaac 


cgatgaggat 


gtgcttcatg 


taaaatttaa 


tgttgacaat 


gtgttgcgga 


840 


agcatcaggc 


agatagaacc 


caagcagtgg 


aaacgataac 


ttggaagaaa 


gttgatgagg 


900 


aacatcttta 


catgactgaa 


catcagatag 


gtgctgccga 


aggacagatg 


gtagttaacg 


960 



P:\OPERWIR0\PI-WSS.PRV - 29t* 



-38- 



aggatgagct 


ttctataact 


gaaattggaa 


tggggagagg 


tgataaaatt 


cagcatgtgc 


1020 


tttctgagga 


agagctttca 


tggtctgaag 


atgaagtgca 


gttaattgag 


gatgatggac 


1080 


aatatgaagt 


tgatgagacc 


tctgtgtccg 


ttaacgttga 


acaagatatc 


caggggtcac 


1140 


cacaggatgt 


tgtggatccg 


caagcactaa 


aggtgatgct 


gcaagaactc 


gctgagaaaa 


1200 


attattcgat 


gaggaacaag 


ctgtttgttt 


ttccagaggt 


agtgaaagct 


gattcagtta 


1260 


ttgatcttta 


tttcaatcgt 


gacctaacag 


ctttggcgaa 


tgaacccgat 


gttgtcatca 


1320 


aaggagcatt 


caatggttgg 


aaatggaggc 


ttttcactga 


aagattgcat 


aagagtgacc 


1380 


ttggaggggt 


ttggtggtct 


tgcaaactgt 


acatacccaa 


ggaggcctac 


agattagact 


1440 


ttgtgttctt 


caacggtcgc 


acggtctatg 


agaacaatgg 


caacaatgat 


ttctgtatag 


1500 


gaatagaagg 


cactatgaat 


gaagatctgt 


ttgaggattt 


cttggttaaa 


gaaaagcaaa 


1560 


gggagcttga 


gaaacttgcc 


atggaagaag 


ctgaaaggag 


gacacagact 


gaagaacagc 


1620 


ggcgaagtaa 


ggaagcaagg 


gctgcagatg 


aagctgtcag 


ggcacaagcg 


aaggccgaga 


1680 


tagagatcaa 


gaacaaaaaa 


ttgcagagta 


tgttgagttt 


ggccagaaca 


tgtgttgata 


1740 


atttgtggta 


catagaggct 


agcacagata 


caagcggaga 


tactatcagg 


ttatactata 


1800 


acagaaactc 


gaggccactt 


gcgcatagta 


ctgagatttg 


gatgcatggt 


ggttacaaca 


1860 


attggtcaga 


tggactctct 


attgttgaaa 


gctttgtcaa 


gtgcaatgac 


agagacggcg 


1920 


attggtggta 


tgcagatggt 


acgacacctc 


aacctttgta 


cataaggcaa 


cattgttttg 


1980 


attttttttg 


ttgaggaaac 


atttgttttg 


attctagcat 


aatgctccta 


caaatatggc 


2040 


augaauuccc 


cugui^u uacu 


gaugi.cai.ga 


gciaeig u ci u u u 


tattaactcg 


aaggccatgg 


"51 nn 
^ xuu 


aagctcaaca 


tttaccatag 


acagacgctt 


aaagatcatt 


tgtattccgt 


ggatcatata 


2160 


tgtaatgtaa 


tacctgtctt 


ttctctatat 


gtacagttat 


tccacctgaa 


aaagcacttg 


2220 


tgttggactg 


ggtttttgct 


gatgggccag 


ctgggaatgc 


aaggaactat 


gacaacaatg 


2280 


ctcgacaaga 


tttccatgct 


attcttccaa 


acaacaatgt 


aaccgaggaa ggcttctggg 


2340 


tgcaagagga 


gcaaaacatc 


tatacaaggc 


ttctgcaaga 


aaggagagaa 


aaggaagaaa 


2400 


ccatgaaaag 


aaaggtgagt 


tgcaacaaaa 


tctttgcata 


t agate 




2446 


<210> 13 

<211> 1032 

<212> DNA 

<213> Triticum sp. 












<400> 13 
gatctctata 


attttggcag 


ttaacccctg 


agtgatggca 


aatatattcc 


ctttcgtcta 


60 


ttttccaaat 


tcaaaatgca 


tggttccatg 


caagcttatc 


caaaatcact 


tgataatata 


120 


ccaatcacaa 


cataactttg 


tttaccataa 


gaacattcct 


acttaaaatt 


tgcaaggtaa 


180 


ctccctttcg 


aggctggttg 


gcttgatgag 


taactggcaa 


ttaacaaaga 


aaagatatat 


240 



* « 
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ctgatgtttg 


gaacaaaaca 


tatgatcagg 


gttgtttggg 


ttgactcatg 


ttccttttta 


300 


cctacacagg 


ctgagagaag 


tgcaaatatc 


aaagctgaga 


tgaaggcaaa 


aactatgcga 


360 


aggtttctgc 


tttcccagaa 


acacattgtt 


tataccgaac 


cgcttgaaat 


acgtgccgga 


420 


accacagtgg 


atgtgctata 


caatccctct 


aacacagtgc 


taaatggaaa 


gccggaggtt 


480 


tggtttagat 


gctcttttaa 


cctttggatg 


catccaagtg 


gagcattgcc 


accccagaag 


540 


atggtgaaat 


caggggatgg 


gccgctctta 


aaagccacag 


gtttattgcg 


ttattacatc 


600 


actgttatta 


gtatatatat 


aaccattttt 


atgcaatcaa 


tagagtcaag 


tgcaactaac 


660 


gatgcacaga 


taggatcaca 


tcattaggag 


aatgatgtga 


tggacaagac 


ccaatcctaa 


720 


gcatagcaca 


agatcgtgta 


gttcgttcgc 


tagagctttt 


ctaatgtcaa 


gtatcatttc 


780 


cttagaccat 


gagattgtgc 


aactcccgga 


tatcgtagga 


gtgctttggg 


tgtatcaaat 


840 


gtcacaacgt 


aactgggtga 


ctataaaggt 


gcactacagg 


tatctccgaa 


agtttctgtt 


900 


gggttggcac 


gaatcgagac 


tgggatttgt 


cactccgtat 


gacggagagg 


tatctttggg 


960 


cccactcggt 


aatgcatcat 


cataatgagc 


tcaatgtgac 


taaggagtta 


gccacgggat 


1020 


cgagaattcc 


eg 










1032 



<210> 14 

<211> 892 

<212> DNA 

<213> Triticum sp. 

<400> 14 

aatatttctt gttctattat tggtaataat tagctagttt aatgccataa gcccataaca 60 
gatatgcaac tactccctcc aatccatatt acttgtcgca actttggtac aactttagta 120 
caaagttata ctaaagctgt gacaagtaat atggaccgga gggagtacta tataagcttg 180 
tagctgtttt gagaccgagt gtctgctcgg gtggctagct ggagcgggct gaagtgcttg 24 0 
caggcacctc ttctctaaaa aaaagtgctt gcagcccccc cgccccctcc atagggtgag 300 
tggtcacctt tcttcttaaa aattatggca ccaagggaaa ttctcggctg gtcgagcttg 360 
tagctatttt ttcggagcgt gaatgggagc gtctttctgt ataaggccta taggcttact 420 
ttgatatata ttgtgaagtc acttaagcct tgttaaaacg tagaaactta gttccgcaac 480 
ttggccaaat ccctgttaaa ttggtttact gtgtactaga tgcatcgatg gcgcagagtc 540 
ccggggggta ataaagcttc cattttctac aatgaagtta attatcctac ttgccttgta 600 
attactgagt acaatacaga gcaccgaaaa gctgtatcct tcctacttcc ttatgtttat 660 
ctgtgttcct tgtctagtta atgttccacc ggatgcctat atgatggact ttgttttctc 720 
cgagtgggaa gaagatggga tctatgacaa caggaatggg atggactatc atattcctgt 780 
ttctgattca attgaaacag agaattacat gcgtattatc cacattgccg ttgagatggc 840 
ccccgttgca aaggtaatat aattctaagg ctagtttctt tgatgcgagg eg 892 
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<210> 15 

<211> 871 

<212> DNA 

<213> Triticum sp. 












<400> 15 
aggttatcct 


ccagaatgaa 


ttttttccag 


tacgtattat 


ttagaatact 


agcggtatat 


60 


tgactttttc 


tttgtgagac 


tacactttct 


tgtttaccat 


tccagtgcac 


catgttcaaa 


120 


atcttgtatt 


cagcgcgtta 


ctttcagttt 


ctttactact 


agcttatttg 


gtgcattggt 


180 


gtttcctttc 


ctactctact 


atctgaatgc 


tacttgtgtt 


ttcgcaacag 


ttgcttcttt 


240 


atccccttcc 


atttctcagt 


taaaaaaact 


tgcatctgta 


ttcacgtgac 


agcatataat 


300 


acattgccat 


gattggtcaa 


gtgctccggt 


cgcctggcta 


tataaggaac 


actattccca 


360 


atccagaatg 


gcaagcactc 


gggttgtatt 


taccatccac 


aatcttgaat 


ttggagcaca 


420 


ttatattggt 


aaagcaatga 


catactgtga 


taaagccaca 


actgtgagtg 


ccttactgtc 


480 


ttgtaatttt 


taatctttct 


gtttggcgca 


cagaaaatct 


tccacatttt 


acagaatcat 


540 


gttcttgtgt 


tttgtacgta 


ttcaactatt 


tccacccaaa 


cttttcaggt 


ttctcctaca 


600 


tattcaaggg 


acgtggcagg 


ccatggtgcc 


attgctcctc 


atcgtgagaa 


attctacggc 


660 


attctcaatg 


gaattgatcc 


agatatctgg 


gatcctgatt 


gccaacatgc 


tgtttggtcg 


720 


tctcgaggtc 


tttacattgc 


tggtgctctt 


taccccgact 


ttctggcgtg 


aatgatggag 


780 


taatacgtga 


aaacattaat 


tcttttctca 


acaagggacg 


gacaaacgcg 


cgagattgcc 


840 


tcctacctgg 


cttcggaact 


gaaagaactg 


9 






871 



<210> 16 

<211> 1592 

<212> DNA 

<213> Triticum sp. 

<400> 16 



cgggaattct 


cgatcccgtg 


gctaactcct 


tagtcacatt 


gagctcatta 


tgatgatgca 


60 


ttaccgagtg 


ggcccaaaga 


tacctctccg 


tcatacggag 


tgacaaatcc 


cagtctcgat 


120 


tcgtgccaac 


ccaacagaaa 


ctttcggaga 


tacctgtagt 


gcacctttat 


agtcacccag 


180 


ttacgttgtg 


acatttgata 


cacccaaagc 


actcctacga 


tatccgggag 


ttgcacaatc 


240 


tcatggtcta 


aggaaatgat 


acttgacatt 


agaaaagctc 


tagcgaacga 


actacacgat 


300 


cttgtgctat 


gcttaggatt 


gggtcttgtc 


catcacatca 


ttctcctaat 


gatgtgatcc 


360 


atacactgac 


aattttatcc 


cggtaccaga 


ttttttccca 


gagtgcaagt 


agatatatac 


420 


caaggccaca 


gatagtttta 


tgcttaacta 


tgtgtttcat 


actacttcag 


gtcccttata 


480 


cttgtgagaa 


tgttgtcgaa 


ggcaagagag 


ctgcaaaaag 


ggccttgcag 


cagaagtttg 


540 


gattacagca 


aactgatgtc 


cctattgtcg 


gaatcatcac 


ccgtctgaca 


gcccagaagg 


600 


gaatccacct 


catcaagcac 


gcaattcacc 


gaaccctcga 


aagcaacgga 


caggttcatc 


660 
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atcccttgtg 


aacgaataaa 


catcaaacgt 


tttgtttata 


aaaagttgct 


tactatttgt 


720 


ttttgtttac 


ttcaaaacaa 


aagtctgaaa 


atgaagtgtt 


tggttcctag 


gtggttttgc 


780 


ttggttcagc 


tccagatcat 


cgaatacaag 


gcgatttttg 


cagattggcc 


gatgctcttc 


840 


acggtgttta 


ccacggtagg 


gtgaagcttg 


ttctaaccta 


cgatgagcct 


ctttctcacc 


900 


tggtgagctc 


caatatccta 


cacaccatct 


agccagccct 


tcattatggg 


agctggagac 


960 


tactttataa 


tttaggttga 


tgatcgatca 


tgctgcagat 


atacgctggc 


tccgacttca 


1020 


ttattgtccc 


ttcaatcttc 


gaaccctgtg 


gcttaacaca 


acttgttgcc 


atgcgttatg 


1080 


gatcgatccc 


tatagttcgg 


aaaaccggag 


gtgtgtgact 


atttctctcc 


attatgctgc 


1140 


actgatttgc 


atatgtcgag 


ctgttggaca 


tgaaatggaa 


actatccttt 


ggtatcgcag 


1200 


gactttacga 


cactgtcttc 


gacgtagaca 


atgataagga 


ccgggctcgg 


tctcttggtc 


1260 


ttgaaccaaa 


tgggttcagt 


ttcgacggag 


ccgacagcaa 


cggcgtggat 


tatgccctca 


1320 


acaggcaagt 


atcgttcctc 


aattagccct 


gaattcagca 


gtagtgctag 


gttatttacc 


1380 


ttgcatgttc 


catacctcat 


ttcagagcaa 


tcggcgcttg 


gttcgatgcc 


cgtgattggt 


1440 


tccactccct 


gtgtaagagg 


gtcatggaac 


aagactggtc 


atggaaccgg 


cccgcactgg 


1500 


actacattga 


attgtaccat 


gccgctcgaa 


aattctgaca 


cccaactgaa 


ccaatggcaa 


1560 


gaacaagcgc 


attgtgggat 


cgagaattcc 


eg 






1592 



<210> 17 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<:223> Description of Artificial Sequence : PEPTIDE MOTIF 
<400> 17 

Asp Val Gin Leu Val Met Leu Gly Thr Gly 
15 10 



<210> 18 
<211> 10 
<:212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE MOTIF 
<400> 18 

Ala Ala Gly Lys Lys Asp Ala Gly lie Asp 
15 10 



<210> 19 
<211> 10 
<212> PRT 

<213> Artificial Sequence 



PAOPER\MRO\PI-WSS.PRV - 29A 



-42- 



<220> 

<223> Description of Artificial Sequence : PEPTIDE MOTIF 
<400> 19 

Ala Thr Gly Lys Lys Asp Ala Gly lie Asp 
15 10 



<210> 20 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE MOTIF 
<400> 20 

Ala Leu Gly Lys Lys Asp Ala Gly lie Asp 
15 10 



<210> 21 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE MOTIF 

<400> 21 

Ala Thr Gly Lys Lys Asp Ala Leu 
1 5 



<210> 22 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE MOTIF 
<400> 22 

Ala Leu Gly Lys Lys Asp Ala Leu 
1 5 



<210> 23 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE MOTIF 
<400> 23 

Ala Ala Gly Lys Lys Asp Ala Arg Val Asp Asp Ala Ala 
1 5 . 10 



<210> 24 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : PEPTIDE MOTIF 
<400> 24 

Ala Leu Gly Lys Lys Asp Ala Gly lie Val Asp Gly Ala 
15 10 



<210> 25 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 

<400> 25 

tgttgaggtt ccatggcacg ttc 23 



<210> 26 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 26 

agtcgttctg ccgtatgatg teg 23 



<210> 27 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<:220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 27 

ccaagtacca gtggtgaacg c 21 



<210> 28 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 28 

cggtgggatc caacggccc 19 



<210> 29 

<211> 20 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence : PRIMER 



<400> 29 
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ggaggtcttg gtgatgttgt 



<210> 30 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 30 

cttgaccaat catggcaatg 



<210> 31 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 31 

cattgccatg attggtcaag 



<210> 32 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 32 

accacctgtc cgttccgttg c 



<210> 33 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 33 

gcacggtcta tgagaacaat ggc 



<210> 34 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence ; PRIMER 
<400> 34 

tctgcatacc accaatcgcc g 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 35 

Lys Val Gly Gly Leu Gly Asp Val 
1 5 

Gin Asp Leu Gly His Asn Val Glu 
20 



Sequence : PEPTIDE MOTIF 



Val Thr Ser Leu Ser Arg Ala Val 
10 15 

Val 
25 



<210> 36 
<211> 25 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE MOTIF 
<400> 36 

Lys Val Gly Gly Leu Gly Asp Val Val Thr Ser Leu Ser Arg Ala He 

Gin Asp Leu Gly His Thr Val Glu Val 
20 25 
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1 50 
wSSIIB ATTTCCTCGG CCTGACCCTG TGCGTTTACC CCACACAGAG CACACTCCAG 

;SSIID 

wSSIIA 

51 100 
wSSIIB TCCAGTCCAG CCCACTGCCG CGCTACTCCC CACTCCCACT GCCACCACCT 

wSSIID 

wSSIIA GCT GCCACCACCT 



101 150 
wSSIIB CCGCcTGCGC CGCGCTCTGG GCGGACCAAC CCGCGCATCG TATCACGATC 

wSSIID 

wSSIIA CCGCCTGCGC CGCGCTCTGG GCGGAGGACC AACCCGCGCA TCGTACCATC 



151 200 
wSSIIB ACCCACCCCG ATCCCGGCCG CCGCCATGTC GTCGGCGGTC GCGTCCGCCG 

WSSIID 

wSSIIA GCCCGCCCCG ATCCCGGCCG CCGCCATGTC GTCGGCGGTC GCGTCCGCCG 



201 250 
wSSIIB CGTCCTTCCT CGCGCTCGCG TCCGCCTCCC CCGGGAGATC ACGGAGGAGG 

WSSIID 

WSSIIA CGTCCTTCCT CGCGCTCGCC TCCGCCTCCC CCGGGAGATC ACGCAGGCGG 

251 300 
WSSIIB ACGAGGGTGA GCGCGTCGCC ACCCCACACC GGGGCTGGCA GGTTGCACTG 

wSSIID 

WSSIIA GCGAGGGTGA GCGCGCCGCC ACCCCACGCC GGGGCCGGCA GGCTGCACTG 



301 350 
WSSIIB GCCGCCGTCG CCGCCGCAGC GCACGGCTCG CGACGGAGCG GTGGCCGCGC 

WSSIID 

wSSIIA GCCGCCGTGG CCGCCGCAGC GCACGGCTCG CGACGGAGGT GTGGCCGCGC 



351 400 
WSSIIB GCGCCGCCGG GAAGAAGGAC GCGGGGAT.. . CGACGACGC CGCGCCCGCG 

wSSIID 

WSSIIA GCGCCGCCGG GAAGAAGGAC GCGAGGGTCG ACGACGACGC CGCGTCCGCG 



401 450 
WSSIIB AGGCAGCCCC GCGCACTCCG CGGTGGCGCC GCCACCAAGG TTGCGGAGCG 

WSSIID 

WSSIIA AGGCAGCCCC GCGCACGCCG CGGTGGCGCC GcCACCAAGG TCGCGGAGCG 



451 500 
WSSIIB GAGGGATCCC GTCAAGACGC TCGATCGCGA CGCCGCGGAA GGTGGCGCGC 

WSSIID 

wSSIIA GAGGGATCCC GTCAAGACGC TCGATCGCGA CGCCGCGGAA GGTGGCGCGC 



501 550 
wSSIIB CGTCCCCGCC GGCACCGAGG CAGGAGGACG CCCGTCTGCC GAGCATGAAC 

wSSIID 

wSSIIA CGGCACCGCC GGCACCGAGG CAGGACGCCG CCCGTCCaCC GAGTATGAAC 



FIGURE 2-1 
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551 600 
GGCATGCCGG TGAACGGTGA AAACAAATCT ACCGGCGGCG GCGGCGCGAC 

GGCACGCCGG TGAACGGTGA GAACAAATCT ACCGGCGGCG GCGGCGCGAC 



wSSIIB 
WSSIID 
wSSIIA 



wSSIIB 
wSSIID 
WSS I lA 



wSSIIB 
wSSIID 
WSSIIA 



601 

TAAAGACAGC 



cAAAGACAGC 
651 

AGAACAGAGT 



AgAACAgAGT 
701 

CCGACGAGCA 



CCGACGAGCA 



GGGCTGCCCG 
GGGCTgcCCG 

ACCGGTGAAT 
ACCAGTGAAC 

TAGCCGAGGT 
TAGCCGAGGT 



CACCCGCACG 
CACCCGcACG 

GGTGAAAACA 
GGTGAAAACA 

CGCGGCTCCG 
CGTGGCTCCG 



CGCGCCCCAG 
CGCGCCCCAT 

AAGCTAACGT 
AAGCTAACGT 

GATCCCGCAG 
GATTCCGCAG 



650 

CCGTCGAGCC 



cCGTCGAcCC 
700 

CGCCTCGCCG 



CGCCTCGCCG 
750 

CTACCATTTC 



CTACCATTTC 



751 800 
wSSIIB CATCAGTGAC AAGGCGCCAG AGTCCGTTGT CCCAGCCGAG AAGGcgccgc 

wSSIID "-CCAGCTGAG AAGACGCCGC 

wSSIIA CATCAGTGAC 7UVGGCGCCGG AGTCCGTTGT CCCAGCCGAG AAGCCGCCGC 

801 850 
WSSIIB CGtCgtcCgg CtcAAATtTc gtgCcCtCgg cttctGctCc cggGtctGAC 
wSSIID CGTCGTCCGG CTCAAATTTC GAGTCCTCGG CCTCTGCTCC CGGGTCTGAC 
WSSIIA CGTCGTCCGG CTCAAATTTC GTGgTCTCGG CTTCTGCTCC CAGGCTGGAC 

851 900 
WSSIIB actgtCaGCG acGtGGaact TgaActGAAg aAGGGtgCgg tCattgTcaA 
WSSIID ACTGTCAGCG ACGTGGAACA AGAACTGAAG AAGGGTGCGG TCGTTGTCGA 
wSSIIA ATTGACAGCG ATGTTGAACC TGAACTGAAG AAGGGTGCGG TCATCGTCGA 

901 950 
wSSIIB aGAAgcTcCa aaCcCaAaGG CTCTTTCGCC GCCCGCAGCA CCCGCTGTAC 
wSSIID AGAAGCTCCA AAGCCAAAGG CTCTTTCGCC GCCtGCAGCc CCCGCTGTAC 
wSSIIA AGAAGCTCCA AACCCAAAGG CTCTTTCGCC GCCTGCAGCC CCCGCTGTAC 

951 1000 
wSSIIB AACAAGACCT TTGGGACTTC AAGAAATACA TTGGTTTCGA GGAGCCCGTG 
wSSIID AAgAAGACCT TTGGGAtTTC AAGAAATACA TTGGTTTCGA GGAGCCCGTG 
wSSIIA AAGAAGACCT TTGGGACTTC AAGAAATACA TTGGCTTCGA GGAGCCCGTG 

1001 1050 

wSSIIB GAGGCCAAGG ATGATGGCCG GGCTGTTGCA GATGATGCGG GCTCCTTCGA 

WSSIID GAGGCCAAGG ATGATGGCCG GGCTGTcGCA GATGATGCGG GCTCCTTtGA 

WSSIIA GAGGCCAAGG ATGATGGCTG GGCTGTTGCA GATGATGCGG GCTCCTTTGA 



1051 1100 
wSSIIB ACACCACCAG AATCACGATT CCGGGCCTTT GGCAGGGGAG AACGTCATGA 
WSSIID ACACCACCAG AATCACGAcT CCGGaCCTTT GGCAGGGGAG AAtGTCATGA 
wSSIIA ACATCACCAG AACCATGATT CCGGACCTTT GGCAGGGGAG AACGTCATGA 
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1101 

wSSIIB ACGTGGTCGT CGTGGCTGCT 
wSSIID ACGTGGTCGT CGTGGCTGCT 
wSSIIA ACGTGGTCGT CGTGGCTGCT 
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1150 

GAATGTTCTC CCTGGTGCAA AACAGGTGGT 
GAgTGTTCTC CCTGGTGCAA AACAGGTGGT 
GAATGTTCTC CCTGGTGCAA AACAGGTGGT 



1151 

wSSIIB CTTGGAGATG TTGCCGGTGC 
wSSIID CTgGGAGATG TTGCgGGTGC 
wSSIIA CTTGGAGATG TTGCCGGTGC 

1201 

wSSIIB TCGTGTTATG GTTGTGGTAC 
WSSIID TCGTGTTATG GTTGTGGTAC 
WSSIIA TCGTGTTATG GTTGTGGTAC 



1200 

TTTGCCCAAG GCTTTGGCGA AGAGAGGACA 
TcTGCCCAAG GCTTTGGCaA AGAGAGGACA 
TTTGCCCAAG GCTTTGGCGA AGAGAGGACA 

1250 

CAAGGTATGG GGACTATGAG GAAGCCTACG 
CAAGGTATGG GGACTATGAa GAACCTACGg 
CAAGGTATGG GGACTATGAG GAAGCCTACG 



1251 

WSSIIB ATGTCGGAGT CCGAAAATAC 
WSSIID ATGTCGGAGT CCGAAAATAC 
WSSIIA ATGTCGGAGT CCGAAAATAC 



1300 

TACAAGGCTG CTGGACAGGA TATGGAAGTG 
TACAAGGCTG CTGGACAGGA TATGGAAGTG 
TACAAGGCTG CTGGACAGGA TATGGAAGTG 



1301 

WSSIIB AATTATTTCC 
WSSIID AATTATTTCC 
wSS I lA AATTATTTCC 



ATGCTTATAT CGATGGAGTT 
ATGCTTaTAT CGATGGAGTT 
ATGCTTATAT CGATGGAGTT 



1350 

GATTTTGTGT TCATTGACGC 
GATTTTGTGT TCATTGACGC 
GATTTTGTGT TCATTGACGC 



1351 

wSSIIB TCCTCTCTTC CGACACCGCC 

wSSIID TCCTCTCTTC CGACACCGAG 

wSSIIA TCCTCTCTTC CGACACCGCC 



1400 

AGGAAGACAT TTATGGGGGC AGCAGACAGG 
AGGAAGACAT TTATGGGGGC AGCAGACAGG 
AGGAAGACAT TTATGGGGGC AGCAGACAGG 



1401 

WSSIIB AAATTATGAA GCGCATGATT 
wSSIID AAATTATGAA GCGCATGATT 
wSSIIA AAATTATGAA GCGCATGATT 



1450 

TTGTTCTGCA AGGCCGCTGT CGAGGTTCCA 
TTGTTCTGCA AGGCCGCTGT TGAGGTTCCA 
TTGTTCTGCA AGGCCGCTGT CGAGGTTCCT 



1451 1500 
wSSIIB TGGCACGTTC CATGCGGCGG TGTCCCTTAT GGGGATGGAA ATCTGGTGTT 
wSSIID TGGCACGTTC CATGCGGCGG TGTCCCTTAT GGGGATGGAA ATCTGGTGTT 
WSSIIA TGGCACGTTC CATGCGGCGG TGTCCCTTAT GGGGATGGAA ATCTGGTGTT 



1501 

wSSIIB TATTGCAAAT GATTGGCACA 
WSSIID TATTGCAAAT GATTGGCACA 
WSSIIA TATTGCAAAT GATTGGCACA 



1550 

CGGCACTCCT GCCTGTCTAT CTGAAAGCAT 
CGGCACTCCT GCCTGTCTAT CTGAAAGCAT 
CGGCACTCCT GCCTGTCTAT CTGAAAGCAT 



1551 

WSSIIB ATTACAGGGA CCATGGTTTG 
WSSIID ATTACAGGGA CCATGGTTTG 
wSSIIA ATTACAGGGA CCATGGTTTG 



1600 

ATGCAGTACA CTCGGTCCAT TATGGTGATA 
ATGCAGTACA CTCGGTCCAT TATGGTGATA 
ATGCAGTACA CTCGGTCCAT TATGGTGATA 



1601 

wSSIIB CATAACATCG CTCACCAGGG 
wSSIID CATAACATCG CTCACCAGGG 
wSSIIA CATAACATCG CGCACCAGGG 



1650 

CCGTGGCCCA GTAGATGAGT TCCCGTTCAC 
CCGTGGCCCT GTAGATGAAT TCCCGTTCAC 
CCGTGGCCCA GTAGATGAAT TCCCGTTCAC 
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1651 

-wSSIIB CGAGTTGCCT 
wSSIID CGAGTTGCCT 
WSSIIA CGAGTTGCCT 



GAGCACTACC TGGAACACTT 
GAGCACTACC TGGAACACTT 
GAGCACTACC TGGAACACTT 



1700 

CAGACTGTAC GACCCCGTGG 
CAGACTGTAC GACCCCGTGG 
CAGACTGTAC GACCCCGTGG 



1701 

wSSIIB GTGGTGAACA CGCCAACTAC 
wSSIID GTGGTGAACA CGCCAACTAC 
wSSIIA GTGGTGAGCA CGCCAACTAC 

1751 

wSSIIB GTTGTCGTCG TGAGCCCGGG 
wSSIID GTTGTCGTGG TGAGCCCGGG 
WSSIIA GTTGTCGTGG TGAGCCCGGG 

1801 

WSSIIB CGGCTGGGGG CTTCACGACA 
wSSXID CGGCTGGGGG CTTCACGACA 
wSSIIA CGGCTGGGGG CTTCACGACA 



1750 

TTCGCCGCCG GCCTGAAGAT GGCGGACCAG 
TTCGCCGCCG GCCTGAAGAT GGCGGACCAG 
TTCGCCGCCG GCCTgAAGAT GgCGGACCAG 

1800 

GTACCTGTGG GAGCTGAAGA CGGTGGAGGG 
GTACCTGTGG GAGCTGAAGA CGGTGGAGGG 
GTACCTGTGG gAGCTCAAGA CGGTGGAgGG 

1850 

TCATACGGCA GAACGACTGG AAGACCCGCG 
TCATACGGCA GAACGACTGG AAGACCCGCG 
TCATACGGCA GAACGACTGG AAGACCCGCG 



1851 

wSSIIB GCATCGTGAA 
wSSIID GCATCGTCAA 
wSSIIA GCATCGTCAA 



CGGCATCGAC AACATGGAGT 

CGGCATCGAC AACATGGAGT 
CGGCATCGAC AACATGGAGT 



1900 

GGAACCCCGA GGTGGACGTC 
GGAACCCCGA GGTGGACGCC 
GGAACCCCGA GGTGGACGTC 



1901 

wSSIIB CACCTCAAGT CGGACGGCTA 

wSSIID CACCTCAAGT CGGACGGCTA 

wSSIIA CACCTCAAGT CGGACGGCTA 



1950 

CACCAACTTC TCCCTGGGGA CGCTGGACTC 
CACCAACTTC TCCCTGAGGA CGCTGGACTC 
CACCAACTTC TCCCTGGGGA CGCTGGACTC 



1951 

wSSIIB CGGCAAGCGG CAGTGCAAGG 
wSSIID CGGCAAGCGG CAGTGCAAGG 
wSSIIA CGGCAAGCGG CAGTGCAAGG 



2000 

AGGCCCTGCA GCGGGAGCTG GGCCTGCAGG 
AGGCCCTGCA GCGCGAGCTG GGCCTGCAGG 
AGGCCCTGCA GCGCGAGCTG GGCCTGCAGG 



2001 

wSSIIB TCCGCGGCGA 
wSSIID TCCGCGCCGA 
wSSIIA TCCGCGCCGA 



CGTGCCGCTG CTCGGCTTCA 
CGTGCCGCTG CTCGGCTTCA 
CGTGCCGCTG CTCGGCTTCA 



2050 

TCGGGCGCCT GGACGGGCAG 
TCGGCCGCCT GGACGGGCAG 
TCGGCCGCCT GGACGGGCAG 



2051 2100 
wSSIIB AAGGGCGTGG AGATCATCGC GGACGCGATG CCCTGGATCG TGAGCCAGGA 
wSSIID AAGGGCGTGG AGATCATCGC GGACGCCATG CCCTGGATCG TGAGCCAGGA 
wSSIIA AAGGGCGTGG AGATCATCGC GGACGCCATG CCCTGGaTCG TGAGCCAGgA 

2101 2150 
wSSIIB CGTGCAGCTG GTCATGCTGG GCACCGGGCG CCACGACCTG GAGGGCATGC 
WSSIID CGTGCAGCTG GTGATGCTGG GCACCGGGCG CCACGACCTG GAGAGCATGC 
WSSIIA CGTGCAGCTG GTCATGCTGG GCACCGGCCG CCACGACCTG gAGAGCATGc 

2151 2200 
WSSIIB TGCGGCACTT CGAGCGGGAG CACCACGACA AGGTGCGCGG GTGGGTGGGG 
WSSIID TGCAGCACTT CGAGCGGGAG CACCACGACA AGGTGCGCGG GTGGGTGGGG 
wSSIIA TgCGGCACTT CGAGCGGGAG CACCACGACA AGGTGCGCGG gTGGGTGGGG 
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2201 

wSSIIB TTCTCCGTGC GGCTGGCGCA 
wSSIID TTCTCCGTGC GCCTGGCGCA 
wSSIIA TTCTCCGTgc GccTGGCGCA 

2251 

wSSIIB CATGCCCTCC CGGTTCGAGC 
wSSIID CATGCCCTCC CGGTTCGTGC 
wSSIIA CATGCCCTCC CGGTTCGAgC 

2301 

wSSIIB CCTACGGCAC CGTCCCCGTC 
WSSIID CCTACGGCAC CGTCCCCGTC 
WSSIIA CCTACGGCAC CGTCCCCGTC 

2351 

wSSIIB GTGCCGCCGT TCGACCCCTT 

wSSIID GTGCCGCCGT TCGACCCCTT 

wSSIIA GTGCCGCCGT TCGACCCCTT 

2401 

wSSIIB CCGCGCAGAG GCGCAGAAGC 
wSSIID CCGCGCCGAG GCGCACAAGC 
wSSIIA CCGCGCCGAG GCGCAcAAGC 

2451 

wSSIIB CCTACCGGGA CTACAAGGAG 
WSSIID CCTACCGAGA CTTCAAGGAG 
wSSIIA CCTACCGGGA CTACAAGGAG 

2501 

wSSIIB TCGCAGGACT TCAGCTGGGA 
WSSIID TCGCAGGACT TCAGCTGGGA 
WSSIIA TCGCAGGACT TCAGCTGGGA 

2551 

wSSIIB CGTCAAGGCC AAGTACCAGT 
wSSIID CGTCAAGGCC AAGTACCAGT 
WSSIIA CcTCAAGGCC AAGTACCAGT 

2601 

wSS I IB CCGCATGCG . ... TGCATGA 
wSSI ID CCGCATGCG . ... TGCATGA 
wSSIIA CCGCATGCGT GCATGcatgA 

2651 

WSSIIB AAGGTGCCAT 

WSSIID AAAGTGCCAT 

wSSIIA AAcGTGCCAT ccttctcgat 

2701 

wSSIIB CAT. .GAGGT GTGTGTGGTT 
wSSIID CAT. .GAGGT GTGTGTGGTT 
wSSIIA CATGAGagGT GTGTGTGGTT 



2250 

CCGGATCACG GCCGGCGCCG ACGCGCTCCT 
CCGGATCACG GCGGGGGCGG ACGCGCTCCT 
CCGGATCACG GCGGGCGCCG ACGCGCTCcT 

2300 

CGTGCGGACT GAACCAGCTC TACGCCATGG 
CGTGCGGGCT GAACCAGCTC TACGCCATGG 
CGTGCGGGTT GAACCAGCTt TACGCCATGG 

2350 

GTGCATGCCG TCGGTGGCCT GAGGGACACC 
GTGCACGCCG TCGGCGGCCT CAGGGACACC 
GTGCACGCCG TCGGCGGGGT GAGGGACACC 

2400 

CAACCACTCC GGGCTCGGGT GGACGTTCGA 
CAACCACTCC GGGCTCGGGT GGACGTTCGA 
CAACCACTCC GGcCTCGGGT GGACGTTCGA 

2450 

TGATCGAGGC GCTCGGGCAC TGCCTCCGCA 
TGATCGAGGC GCTCGGGCAC TGCCTCCGCA 
TGATCGAGGC GCTCGGGCAC TGCCTCCGCA 

2500 

AGCTGGAGGG GGCTCCAGGA GCGCGGCATG 
AGCTGGAGGG CCCTCCAGGA GCGCGGCATG 
AGCTGGAGGG GcCTCCAGGA GCGCGGCATG 

2550 

GCATGCCGCC AAGCTCTACG AGGACGTCCT 
GCACGCCGCC AAGCTCTACG AGGACGTCCT 
GCATGCCGCC AAGCTCTACG AGGACGTCCT 

2600 

GGTGAACGCT AGCTGCTAGC CGGTCCAGCC 
GGTGAACGCT AGCTGCTAGC CGCTCCAGCC 
GGTGAACGCT AGCTGCTAGC CGcTCCAGCC 

2650 

CAGGATGGAA TTGCGCATTG CGCACGCAGG 
CAGGATGGAA CT , . GCATTG CGCACGCAGG 
gAGGgTGGAA cTGCGCATTG CGCcCGCAGG 

2700 

.GGAGCGCCG GCATCCGCGA AGTACAGTGA 
.GGAGCGCCG GCATCCGCGA AGTACAGTGA 
gGGAGCGCCG GCATCCGCGA gGTgCAGTGA 

2750 

GAGACGCTGA TTC C GATCTGGTCC 

GAGACGCTGA TTC C AATCCGGCCC 

GAGACGCTGA TTCCGATCTc gatctGGTCC 
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2751 2800 
WSSIIB GTAGCAGAGT AGAGCGGAGG TAGGGAAGCG CTCCTTGTTA CAGGTATATG 
WSSIID GTAGCAGAGT AGAGCGGAGG TATATGGGAA TCTTAACTTG GTATTGTAAT 
wSSIIA GTAGCAGAGT AGAGCGGAcG TAGGGAAGCG CTCCTTGTTg CAGGTATATG 

2801 2850 

wSSIIB GGAATGTTGT TAACTTGGTA TTGTAATTTG TTATGTTGTG TGCATTATTA 

wSSIID TTGTTATGTT GTGTGCATTA TTACAATGTT GTTACTTATT CTTGTTAAGT 

wSSIIA GGAATGTTGT cAACTTGGTA TTGTAgTTTG cTATGTTGTa TGCgTTATTA 

2851 2900 
WSSIIB CAGAGGGCAA CGATCTGCGC CGGCGCACCG GCCCAACTGT TGGGCCGGTC 
wSSIID CGGAGGCCAA GGGCGAAAGC TAGCTCACAT GTCTGATGGA TGCAAAAAAA 
wSSIIA caatgttgtt acttattctt gtTAAAAAAA AAAAAAAAAA TVAAA 



wSSIIB 
wSSIID 
wSSIIA 



2901 2950 
GCACAGCAGC CGTTGGATCC GACCGCCTGG GCCGTTGGAT CCCACCGAAA 
AAAAAAAAAA AAA 



WSSIIB 
wSSIID 
wSSIIA 



2951 2965 
AAAAT^AAAAA AAAAA 
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WSSIIA 

WSSIIB 

WSSIID 

ZSSIIA 

2SSIIB 

PEASSII 

POTSSII 



WSSIXA. 

WSSIIB 

WSSIID 

ZSSIXA 

ZSSIIB 

PRASSII 

POTSSII 



1 MSSAVASAAS FLAIASA SP-GRSRRRA RVSAPPPHAG AGRL HW PPWPP-qrtA 5 1 

2 ********** + + _ ^ ^t^^trj, *^,|f Tf y + **:rp^t * * * * •* * * * S ★ ★ - * W * 

X ****AV*SS* STF******* ★'^G**--*** ♦*GSS*F*T* *-S*SFAFWA **S**RAPRD ,57 

1 ♦PG*-I*SS+ SAFL*PV**S *♦ R***G S*G*ALRSY* YSGAELRL** ARRG*P*DG* 56 

1 *MIjSL,G*D*T VLP*H*KNLK FTPKL+TLNG --DLAFSKGL GVGRLNCaSV R 49 

10 PVNFIFCDFY VMEN$I*LHS GNQFHPNLPL IiALRPKK LSLIHGSSRE q 57 
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50 

MEMVLRSQSP LCLRSGPVLI FRPTVAGGGG GTQSLLRTTR FARRRVIRCV 

MEMSLWPRSP LCPRSRQPLV V.VRPAGRGG LTQPFLMNGR FTRSRTLRCM 

51 100 
VASPGCPNRK S.RTASPNVK VAAYSNYAPR LLVESSSKKS EHHDSSRHRE 

VASSDPPNRK SRRMVPPQVK VISSRGYTTR LIVEPSNENT EHNNRD . . . E 

101 150 
ETIDTYNGLS GSDAAELTSN RDVEIEVDLQ HISEEELPGK VSINASLGEM 

ETLDTYNALL STETAEWTDN REAE TAKADSSQN AIiSSSIIGEV 

151 200 
ETVDEAEVEE DKFEVDTSGI VLRNVAVREV DPKDEHNAKD VFWDSSGTA 

DVADEDILAA DLTVYSLSSV MKKEVDAADK ARVKE . . . .D AFELDXASTT 

201 

PDNAAVEEW DEAEVEEDMV DVDILGLDLN NATIEEIDLM 

LRSVIVDVMD HXWDCQETLR SVIVDVMDHN GTVQETLRSV IVDVMDDAAD 

251 300 
DSPGNASSGR TYGGVDELGE LPSTSVDCIA INGKRRSLKP KPLPIVRFQE 

KARVEEDVFE LDLSGNISSS ATTVELDAVD EVGPVQDKFE ATSSGNVSNS 

301 350 
QEQIVLSIVD EEGLIASSCE . EGQPWDYD KQEENSTAFD EQKQLTDDFP 

ATVREVDASD EAGNDQGIFR ADLSGNVFSS STTVEVGAVD EAGSIKDRFE 

351 400 
EEGISIVHFP EPKNDIVGSS KFLEQKQELD GSYKQDRSTT GliHEQDQSW 

TDSSGNVSTS APMWDAIDET VADQDTFEAD LSGNASSCAT YREVDDWDE 

401 450 
SSHGQDKSIV GVPQQIQYND QSIAGSHRQD QSIAGAPEQI QSVAGYIKPN 

— MDVPFPLH 

TRSEEETFAM DL FAS ESGHEKHMAV DYVGEATDEE ETYQQQYPVP 
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451 

QSIVGSCKQH 
RSLSCTSVSN 
SSFSMWDKAI 



ELIIPEPKKI 
AITHLKIKPI 
AKTGVSLNPE 



ESIISYNEID 
LGFVSHGTTS 
LRLVRVEE . . 



QSIVGSH.KQ 
XiSVQSSSWRK 
QGKVNFSDKK 



250 

EEALLENFDV 



500 

DKSWSVPEQ 
DGMVTGVSFS 
DLSIDDLPGQ 



501 550 
IQSIVSHSKP NQSTVDSYRQ AESIIGVPEK VQSITSYDKL DQSIVGSL.KQ 
ICANFSGRRR RKVSTPRSQG SSPKGFVPRK PSGMSTQRKV QKS.NGDKES 
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wheatSSIII NQSIIGSYKQ DKSIADVAGP TQSIFGSSKQ HRSIVAFPKQ 



maizeSSIII 
potatoSSIII 
wheatSSIII 



551 600 
DEPIISVPEK IQSIVHYTKP NQSIVGLPKQ QQS IVHIVEP KQSIDGFPKQ 
KSTSTSKESE ISNQKTVEAR VETSDDDTKG WRDHKFLED EDEINGSTKS 
NQSIVSVTEQ KQSIVGFRSQ DLSAVSLPKQ NVPIVGYVER GSNXKQVPW 



601 650 
maizeSSIII .DLSIVGISN EFQTKQLATV GTHDGLLMKG VEAKETSQKT EGDTLQATFN 
potatoSSIII ISMSPVRVSS QFVESEETGG DDKDAVKLNK SKRSEES . . . .GFIIDSVIR 
WheatSSIII DRQDALYVNG LEA KEGDHTSEKT DEDALHVKFN 



maizeSSIII 
potatoSSIII 
WheatSSIII 



651 700 
VDNLSQKQEG LTKEADEITI lEKINDEDLV MIEEQKSIAM NEEQTIVTEE 
EQSGSQGETN ASSKGSH.AV GTKLYEILQV DVEPQQ . . . L KENNAGNVEY 
VDNVLRKHQA DRTQAVEKKT WKKVDEEHLY MTEHQKRAA. . EGQMWNED 



maizeSSIII 
potatoSSIII 
wheatSSIII 



701 750 
DIPMT^KVEIG IDKAKFL.HL LSEEESSWDE NEVGIIEADE QYEVDETSMS 
KGPVASKLLE ITKASDVEHT ESNEIDDLDT NSFFKSDLIE EDEPLAAGTV 
ELSIT..EIG MGRGDKIQHV LSEEELSWSE DEVQLIEDDG QYEVDETSVS 



maizeSSIII 
potatoSSIII 
wheatSSIII 



751 800 
. . TEQDIQES PNDDLDPQAL WSMLQELAEK NYSLGNKLFT YPDVLKADST 
..ETGDSSLN LRLEMEANLR RQAIERLAEE NLLQGIRLFC FPEWKPDED 
VNVEQDIQGS PQDWDPQAL KVMLQELAEK NYSMRNKLFV FPEWKADSV 



801 850 

maizeSSIII IDLYFNRDLS AVANEPDVLI KGAFNGWKWR FFTEKLHKSE LAGDWWCCKL 

potatoSSIII VEIFLNRGLS TLKNESDVLI MGAFNEWRYR SFTTRLTETH LNGDWWSCKI 

wheatSSIII IDLYLNRDLT ALANEPDWI KGAFNGWKWR LFTERLHKSD LGGVWWSCKL 

851 900 
maizeSSIII YIPKQAYRMD FVFFNGHTVY ENNNNNDFVI QIESTMDENL FEDFLAEEKQ 
potatoSSIII HVPKEAYRAD FVFFNGQDVY DNNDGNDFSI TVKGGMQIID FENFLLEEKW 
WheatSSIII YIPKEAYRLD FVFFNGRTVY ENNGNNDFCI GIEGTMNEDL FEDFLVKEKQ 

901 950 
maizeSSIII RELENIxANEE AERRRQTDEQ RRMEEERAAD KADRVQAKVE VETKKNKLCN 
potatoSSIII REQEKLAKEQ AERERLAEEQ RRIEAEKAEI EADRAQAKEE AAKKKKVLRE 
wheatSSIII RELEKLAMEE AERRTQTEEQ RRRKEARAAD EAVRAQAKAE lEIKKKKLQS 

951 1000 
maizeSSIII VLGLARAPYD NLWYIEPITT GQEATVRLYY NINSRPLVHS TEIWMHGGYN 
potatoSSIII LMVKATKTRD ITWYIEPSEF KCEDKVRLYY NKSSGPLSHA KDLWIHGGYN 
WheatSSIII MLSLARTCVD NLWYIEASTD TRGDTIRLYY NRNSRPLAHS TEIWMHGGYN 

1001 . 1050 

maizeSSIII NWIDGLSFAE RLVHHHDKDC DWWFADWVP ERTYVLDWVF ADGPPGSARN 

potatoSSIII NWKDGLSIVK KLVKSERIDG DWWYTEWIP DQALFLDWVF ADGPPKHAIA 

WheatSSIII NWXDGLSIVE SFVKCNDKDG DWWYADVIPP EKALVLDWVF ADGPAGNARN 



1051 1100 

maizeSSIII YDNNGGHDFH ATLP.NNMTE EEYWMEEEQR lYTRLQQERR EREEAIKRKA 

potatoSSIII YDNNHRQDFH AIVP.NHIPE ELYWVEEEHQ IFKTLQEERR LREAAMRAKV 

WheatSSIII YDNNARQDFH AILPNNNVTE EGFWAQEEQN lYTRLIiQERR EKEETMKRKA 
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ERNAKMKAEM KEKTMRMFLV SQKHIVYTE . PLEIHAGTTI DVLYNPSNTV 
EKTALLKTET KERTMKSFLL SQKHWYTE . PLDIQAGSSV TVYYNPANTV 
ERSANIKAEM KAKTMRRFLL SQKHIVYTRT XLKYVPGTTV DVLYNPSNTV 



maizeSSIII 
potatoSSIII 
wheatSSIII 



maizeSSIII 
potatoSSIII 
wheatSSIII 



maizeSSIII 
potatoSSIII 
wheatSSIII 



maizeSSIII 
potatoSSIII 
WheatSSIII 



maizeSSIII 
potatoSSIII 
WheatSSIII 



maizeSSIII 
potatoSSIII 
wheatSSIII 



maizeSSIII 
potatoSSIII 
wheatSSIII 



maizeSSIII 
potatoSSIII 
WheatSSIII 



maizeSSIII 
potatoSSIII 
WheatSSIII 



maizeSSIII 
potatoSSIII 
WheatSSIII 



maizeSSIII 
potatoSSIII 
WheatSSIII 



maizeSSIII 
potatoSSIII 
WheatSSIII 
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LTGKPEVWFR 
LNGKPEIWFR 
LNGKSEGWFR 



CSFNRWMYPG 
CS FNRWTHRL 
CSFNLWMHSS 



GVLPPQKMVQ 
GPLPPQKMSP 
GALPPQKMVK 



AENGSHLKAT 
AENGTHVRAT 
SGDGPLLKAT 



1200 
VYVPRDAYMM 
VKVPLDAYMM 
VDVPPDAYMM 



1201 1250 
DFVFSESEEG GIYDNRNGLD YHIPVFGSIA KEPPMHIVHI AVEMAPIAKV 
DFVFSEREDG GIFDNKSGMD YHIPVFGGVA KEPPMHIVHI AVEMAPIAKV 
DFVFSEWEED GIYDNRNGMD YHIPVSDSIE TENYMRIIHI AVEMAPVAKV 

1251 1300 

GGLGDWTSL SRAVQDLGHN VEVILPKYGC LNLSNVKNLQ IHQSFSWGGS 

GGLGDWTSL SRAVQDLNHN VDIILPKYDC LKMNNVKDFR FHKNYFWGGT 

GGLGDWTSL SRAIQDLGHT VEVILPKYDC LNQSSVKDLH LYQSFSWGGT 

1301 1350 

EINVWRGLVE GLCVYFLEPQ NGMFGVGYVY G.RDDDRRFG FFCRSALEFL 

EIKVWFGKVE GLSVYFLEPQ NGLFSKGCVY GCSNDGERFG FFCHAALEFL 

EIKVWVGRVE DLTVYFLEPQ NGMFGVGCVY G.RNDDRRFG FFCHSALEFI 

1351 1400 

LQSGSSPNII HCHDWSSAPV AWLHKENYAK SSLANARWF TIHNLEFGAH 

LQGGFSPDII HCHDWSSAPV AWLFKEQYTH YGLSKSRIVF TIHNLEFGAD 

LQNEFSPHII HCHDWSSAPV AWLYKEHYSQ SRMASTRWF TIHNLEFGAH 

1401 1450 
HIGKAMRYCD KATTVSNTYS KEVSGHGAIV PHLGKFYGIL NGIDPDIWDP 
LIGRAMTNAD KATTVSPTYS QEVSGNPVIA PHLHKFHGIV NGIDPDIWDP 
YIGKAMTYCD KATTVSPTYS RDVAGHGAIA PHREKFYGIL NGIDPDIWDP 

1451 1500 
YNDNFIPVHY TCENWEGKR AAKRALQQKF GLQQIDVPW GIVTRLTAQK 
LNDKFIPIPY TSENWEGKT AAKEALQRKL GLKQADLPLV GIITRLTHQK 
YTDNFIPVPY TCENWEGKX AAKRALQQKF GLQQTDVPIV GIITRLTAQK 

1501 1550 
GIHLIKHAIH RTLERNGQW LLGSAPDSRI QADFVNLANT LHGVNHGQVR 
GIHLIKHAIW RTLERNGQW LLGSAPDPRV QNNFVNLANQ LHSKYNDRAR 
GIHLIKHAIH RTLESNGQW LLGSAPDHRI QGDFCRLADA LHGVYHGRVK 

1551 1600 

LSLTYDEPLS HLIYAGSDFI LVPSIFEPCG LTQLVAMRYG TIPIVRKTGG 

LCLTYDEPLS HLIYAGADFI LVPSIFEPCG LTQLTAMRYG SIPWRKTGG 

LVLTYDEPLS HLIYAGSDFI IVPSIFEPCG LTQLVAMRYG SIPIVRKTGG 

1601 1650 
LFDTVFDVDN DKERARDRGL EPNGFSFDGA DSNGVDYALN RAISAWFDAR 
LYDTVFDVDH DKERAQQCGL EPNGFSFDGA DAGGVDYALN RALSAWYDGR 
LXDTVFDVDN DKDRARSLGL EPNGFSFDGA DSNGVDYALN RAIGAWFDAR 

1651 1686 

SWFHSLCKRV MEQDWSWNRP ALDYIELYRS ASKL*- 

DWFNSLCKQV MEQDWSWNRP ALDYLELYHA ARKLE* 

DWFHSLCKRV MEQDWSWNRP ALDYIELYHA ARKF*- 
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